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(SHER) 


“Though what I’m saying is perhaps not new, I have 
felt it quite vividly on this new occasion.” 


J. W. Goethe, in a letter from Naples, 17 May 1787 


Foreword 


The present textbook is my best effort to write a lively, problem-oriented and under- 
standable introduction to classical modern algebra. Besides careful exposition, my 
goals were to lead the reader right away to interesting subject matter and to assume 
no more background than that provided by a first course in linear algebra. 

In keeping with these goals, the exposition is by and large geared toward certain 
motivating problems; relevant conceptual tools are introduced gradually as needed. 
This way of doing things seems more likely to hold the reader’s attention than a 
more or less systematic stringing together of theorems and proofs. The pace is more 
leisurely and gentle in the beginning, later faster and less cautious, so the book lends 
itself to self-study. 

This first volume, primarily about fields and Galois theory, in order to deal with 
the latter introduces just the necessary amount of group theory. It also covers basic 
applications to number theory, ring extensions and algebraic geometry. I have found 
it advantageous for various reasons to bring into play early on the notion of the 
algebraic closure of a field. Naturally, Galois’ beautiful results on solvable groups 
of prime degree could not be left out, nor could Dedekind’s Galois-theoretical arith- 
metic reduction principle. Infinite Galois extensions are not neglected either. Finally, 
it seemed appropriate to include the fundamentals of transcendental extensions. 

At the end of the volume there is a collection of exercises, interspersed with 
remarks that enrich the text. The problems chosen are of widely varying degrees of 
difficulty, but very many of them are accompanied by hints — sometimes amounting 
to an outline of the solution— and in any case there are no outright riddles. These 
exercises are of course meant to allow readers to practice their grasp of the material, 
but they serve another important purpose as well: precisely because the main text 
was kept short and to the point, without lots of side-results, the appendix will give 
the reader a better idea of the wealth of consequences and applications derived from 
the theory. 

The linear algebra facts used, when not totally elementary, are accompanied by 
references to my Lineare Algebra, now published by Spektrum Akademischer Verlag 
and abbreviated LA I and LA II. This has not been translated, but equivalent spots in 
other linear algebra textbooks are not hard to find. Theorems and lesser results are 
numbered within each chapter in sequence, the latter being marked F1, F2, ...— the 
F is inherited from the German word Feststellung. Allusions to historical matters 
are made only infrequently (but certainly not at random). When a theorem or other 


vi Foreword 


result bears the name of a mathematician, this is sometimes a matter of tradition 
more than of accurate historical origination. 

The first German edition of this book appeared in 1987. I thank my colleagues 
who, already back at the writing stage, favored it with their interest and gave me 
encouragement — none more than the late H.-J. Nastold, with whom I had many 
fruitful conversations, W. Liitkebohmert, who once remarked that there was no 
suitable textbook for the German Algebra I course, O. Willhéft, who suggested 
several good problems, and H. Schulze-Relau and H. Epkenhans, whose critical 
perusal of large portions of the manuscript was a great help. The second (1991) and 
third (1995) editions benefited from the remarks of numerous readers, to whom I 
am likewise thankful, in particular R. Alfes, H. Coers, H. Daldrop and R. Schopohl. 
The response and comments on the part of students were also highly motivating. 
Special thanks are due to the publisher BI-Wissenschaftsverlag (later acquired by 
Spektrum) and its editor H. Engesser, who got me going in the first place. 

The publication of this English version gives me great pleasure. I’m grateful 
to Springer-Verlag New York and its mathematics editor Mark Spencer, for their 
support and competent handling of the project. And not least for seeing to it that 
the translation be done by Silvio Levy: I have observed the progress of his task with 
increasing appreciation and have incorporated many of the changes he suggested, in 
a process of collaboration that led to noticeable improvements. Further perfecting 
is of course possible, and readers’ suggestions and criticism will continue to be 
welcome and relevant for future reprints. 


Miinster, July 2005 Falko Lorenz 


Contents 


Horeword ice: feo lates fea wie Ge i PS Sek Se eee oe ee eS v 
1 = Constructibility with Ruler and Compass ......................... 1 
2 Algebraic Extensions ............. 0.0... 15 
3 Simple Extensions ............. 0.0... ee 21 
4 Fundamentals of Divisibility..................... 0.0.00. cece 33 
5 Prime Factorization in Polynomial Rings. Gauss’s Theorem......... 45 
6 Polynomial Splitting Fields ....................0.00000.0 0.0.2 55 
7  Separable Extensions ............. 0.0.0... eee eee ees 65 
8 Galois Extensions ............. 0.0... cece ees 75 
9 Finite Fields, Cyclic Groups and Roots of Unity ................... 83 
10 Group Actions iii. g i ee te lene bode epee ee eee ele 93 
11 Applications of Galois Theory to Cyclotomic Fields ................ 103 
12 Further Steps into Galois Theory ..................... 00. eee ee eee 115 
13.) Normand Trace ................ 0... cece ccc ccc eee ees 133 
14 Binomial Equations ................... 0... eee eee 143 
15 Solvability of Equations.................. 0... cece eee eee 165 
16 Integral Ring Extensions ................... 00... cece cece eee eee 191 
17 The Transcendence of 7 .............. 0... cece eect eens 203 
18 Transcendental Field Extensions ....................... 0.0 eee eee 209 
19 Hilbert’s Nullstellensatz................. 0... 217 
Appendix: Problems and Remarks ....................... 0000s eee eee 231 
Index:of Notations s..:.3.2 -eseesa6 hiv yes a ee Ee pa ee ee be ee eed 283 


1 


Constructibility with Ruler and Compass 


1. In school one sometimes learns to solve problems where a certain geometric 
figure must be constructed from given data. Such construction problems can be 
quite difficult and afford a real challenge to the student’s intelligence and ingenuity. 

If you have tried long and hard to solve a certain construction problem, to no 
avail, you might then wonder whether the required construction can be carried out at 
all. Whether a construction exists is a fundamental question: so much so that there 
are some construction problems that had already been entertained by the ancient 
Greeks, and yet remained unsolved for two thousand years and more. 

For example, nobody has ever been able to state a procedure capable of dividing 
an arbitrary angle into three equal angles, using ruler and compass. Now, of course 
construction problems range widely in degree of difficulty: think of the similar- 
looking problem of dividing an arbitrary segment into three equal parts using ruler 
and compass — not totally trivial, but after some thought just about anyone can 
carry out the construction. The problem of constructing a regular pentagon is also 
solvable, but already somewhat more complicated. So it is certainly understandable 
that even a construction problem that has eluded would-be solvers for a long time 
should leave room for hoping that success might yet be achieved through greater 
ingenuity. Perhaps, then, the question of whether a particular construction with ruler 
and compass is possible is not one that comes to mind immediately. 

Even if someone asks this question of principle, it is not clear a priori that there 
is a promising way to tackle it. Yet there is, as the development of algebra since 
Gauss (1777-1855) has shown. I would like to explain now, at the beginning of our 
introduction to algebra, how one can arrive at broad statements about the general 
constructibility problem, by translating this geometric problem into an algebraic one. 
As we elaborate on this, we will have the chance to motivate quite naturally certain 
fundamental algebraic concepts. Moreover the subsequent treatment of the derived 
algebraic problem will require many of the tools usually treated in an Algebra I 
course. This procedure has the advantage that one starts from a concrete and easily 
understood question and keeps the goal of solving it in mind as one goes along. 

Let it be said, however, that the problem of constructibility with ruler and com- 
pass by no means played a central role in the development of algebra. In this 
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regard the problem of solving algebraic equations by means of radicals was surely 
more significant, not to mention other motivations and stimuli coming from outside 
algebra — from number theory and analysis, for example. Incidentally, in due time 
we will make precise the problem of solubility of equations by radicals and keep it 
in view as the exposition unfolds. 


2. First let’s describe properly what is to be understood by constructibility with 
ruler and compass. For this we start from the plane R? of elementary geometry. A 
construction problem asks whether a certain point P of the plane can be constructed 
with ruler and compass, starting from a given initial set / of points. Thus, let a 
subset M of R? be given (we may as well assume it has at least two points). Then 
look at the set 


AM = {P eR? | P is constructible from M with ruler and compass}, 
to be defined more precisely as follows. Let 
Li(M) = set of straight lines joining two distinct points of WV, 


Ci(M) = set of circles whose center belongs to M 
and whose radius equals the distance between two points of M. 


Then consider the following elementary steps for the construction of “new” points: 
(i) intersecting two distinct lines in Li(/); 
(ii) intersecting a line in Li(/) with a circle in Ci(M); 

(iii) intersecting two distinct circles in Ci(/). 

Let M’ be the union of M with the set of points obtained by the application of 
one of these steps. The points of R* that can be obtained by repeated application 
of steps (i)-(iii), starting from M and replacing M by M’ each time, are said to be 
constructible from M with ruler and compass. They form the set AM. 

We just mention right now four well known constructibility problems that were 
posed already by the ancient Greeks. 


a: Trisection of the angle. Q' 


Given an angle of measure ¢, construct an angle of 
measure g/3 with ruler and compass. We regard 
the given angle as determined by its vertex S and YX 
points Q, Q’ on each of its sides; one may as well 

assume that Q and Q’ are equidistant from S. Let 

X be the point indicated in the figure. The question 0/3 

then is whether ¥ € A{S, O, Q’}. 


B: Doubling of the cube (Delian problem). 


Given a cube of side length a, find a cube of twice the volume. The side length x of 
the desired cube satisfies = 2a3, so x =aJ/2. Thus let P, Q,X be points on the 
real line such that PO =a and PX =a V2; the question is whether Y € A{P, Q}. 
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Given a circle, construct a square of the same surface area. The given circle of 
radius r is determined by points P and Q a distance r = PQ apart. The side length 
x of the desired square must satisfy x* = 2r?, so x =r./7. What must be decided, 
then, is whether some point X such that PY = r./z belongs to A{P, O}. 


6: Construction of a regular n-gon (n-section of the circle). 


As before, we think of the circle as being given by x 

two points P and Q, the center and a point on the 

circumference. Let X be the point shown on the 

right. For what natural numbers n does X lie in 

A{P,Q}? S 
This is the case, for example, for m = 6 (and 

therefore for all numbers of the form n = 3-2”): 

it is enough to draw a circle of radius PO with 

center QO. By successively repeating the procedure 

with the newly found points one gets the well known rosette: 


3. To make the problem of constructibility with ruler and compass accessible, one 
must first “‘algebraize” it. To that end it is useful to employ the identification 


R*=C, 


that is, to regard points in the plane as complex numbers, and so take advantage of 
the possibility not only of (vector) addition but also of multiplication. Assuming 
the basic properties of the field C of complex numbers, the problem of dividing the 
circle into n parts (see 6 above) amounts to the following question: Is it the case 
that 

gre WO. 1)? 


The next statement points out that the fundamental algebraic operations of C 
can be described constructively. 


F1. Let M be any subset of C containing the numbers 0 and 1. Then: 
(1) 7€ AM; 
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(2) zEAM > ZEAM; 

(3) zEAM => Rez, Imze AM; 

(4) ze AM > -zeE AM; 

(5) 21,22€AM > 714+2€ AM; 

(6) 271,22€AM > 71272€ AM; 

(7) zE AM, 240 => 1/z EAM. 
Proof. (1) The line connecting 0 and 1, that is, the real line R, belongs to Li(M) by 
definition. Intersecting R with the unit circle, which belongs to Ci(M/), we see that 


—1 eM. If we now construct the perpendicular bisector of the interval [—1, 1] in 
the well-known way and intersect it with the unit circle, we obtaini ¢ AM. 


(2) Drop a perpendicular from z to R. From the foot of this perpendicular, say a, 
draw a circle whose radius is the distance from a to z. Its second intersection with 
the straight line through z anda gives 7¢€ AM. 


(3) As just verified, we have a = Rez € AM. To obtain b = Imz, draw the 
perpendicular to the imaginary axis through z, and then transfer to R the absolute 
value of the foot bi of the perpendicular. 


(4) Intersect the line through 0 and z with the circle of radius |z| and center 0. 


(5) Intersect the circle of center z,; and radius |z2| with the circle of center z. and 
radius |z;|. One of the intersections is the vertex z; + Z2 of the parallelogram 
determined by 72, Z2. 


(6) If z] =a, + ib; and z2 = a2 + ib2 we have 

2122 = (a,d2 — bbz) + (ayb2 +.a2b;)i. 
Now 21,22 € AM implies a;,b;,a2,b2 € AM, by (3). If we assume the claim is 
true for real numbers, it will also be true for arbitrary complex numbers, because of 
(4) and (5). Therefore we must prove that given real numbers r; and r2, 


1,72 EAM => rr2€ AM. 


Clearly one can assume 7;, 12 > 0. To complete the proof, consider this diagram: 
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Here z is the appropriate intersection of the line through 0 and 1 +7 with the 
circle of radius r and center 0, and the dashed line indicates a parallel to the line 
through 1 and z. By similarity of triangles we have x:r2 = 1r;:1, and therefore 
X =11rz. Since x lies in A.M, this proves the claim. 


(7) Since z~! = Z-(zzZ)~!, it suffices in view of the earlier parts to show that if 
r > 0 lies in A.M, so does r~!. To do this we refer to the following diagram: 


r 


By similarity of triangles, r:1 = 1:x, and this proves the claim since x € AM. 


As a consequence of Fl we will explicitly state again: 


F2. Let M be a subset of C containing the points 0 and 1. Then A.M is a subfield of 
C. It is called the field of numbers constructible from M. 


In particular, @ C A.M, since @ is the smallest subfield of C. Also the set 
{a+ bi | a,b € Q}— which incidentally is also a subfield of A.M —is contained 
in AM. But the field A.M is substantially larger: 


F3. The field A.M is quadratically closed, that is, for every z € C we have 
(8) ze AM => Vze AM, 


where ./Z represents any complex number w with w? =z . 


Proof. Suppose w? = z = re!®. Letting ./r be the positive square root of r € R, 


we have w = +./r e’?/?. Since it is always possible to bisect an angle with ruler 
and compass, it is enough in order to prove (8) to show that for any r > 0 in AM, 
the square root ./r is also in AM. To do this we raise the perpendicular to the 
segment [—1,7] through 0 and intersect it with the semicircle constructed over the 
same segment, to obtain a point v: 


Vv 
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Then x = |v| belongs to AM. By Thales’ Theorem the triangle with vertices 
—l,v,r has a right angle; applying the formula for the altitude of a right triangle 
we get x? = 1-r, and so x = \/r. 


4. Now let’s return to the statement of F2, which says that AM is a field containing 
Q as a subfield. Trivially, 


(9) M CAM. 


Thus the field E := AM also contains all numbers that can be obtained from k := Q 
and from M by means of arithmetic operations in EL. This serves as the motivation 
for a simple but fundamental definition: 


Definition 1. Let E be a field and k a subfield of E. We say that E is an extension 
of k. Let A be any subset of FE. Set 


k(A) :=_ intersection of all subfields F of E such that k C F and AC F. 


We call k(A) the subfield of EF’ generated by A over k, and we also say that k(A) 
arises from k by adjoining to k the elements of A. (The corresponding noun is 
adjunction.) Clearly, k(A) is the smallest subfield of E containing k and A. In the 
case of a finite set A = {a1,...,@m} we also denote the field k(A) by 


Kk (1, Q2,...,Qm). 


Example. Take E = C,k = Q, A= {i}. We claim that 
Q(i) = {a+ bi |a,b € Q}. 


Proof. Let Fo be the set on the right-hand side. Then @ C Fo and {i} C Fo no 
matter what. Since Q(7) is a subfield of EF, we also have Fo C Q(Z). To prove the 
claim we must show that Fo is a subfield of C. Clearly, Fo is closed under addition 
and multiplication. There remains to show that if z =a+bi 40 lies in Fo, so does 


z—!, But 
= =( z)7} a—ib a 5 b 
zZ =2Z(2zz) = —— = = - I Ss, 
a2+b2 a2+b2 — a?+b? 


so z~! does lie in Fo. 

Warning. In the situation of Definition 1 it is not generally true that 
kK(a@) = {ao +aya+-+-+ana” | aj ek, n= O}. 

We will return to this point in Chapter 3. 


Now let’s return to the earlier situation. We know that A is an extension of 
Q containing M. Thus it contains the well-defined subfield Q(M) obtained from 
Q by adjunction of M. We set M = {Z| z © M} and consider the subfield 


(10) K:=Q(MUM) 
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of A.M obtained from Q by adjunction of the set A := MU M (because of (2) and 
(9), we know that A Cc AM). Since M C K CAM we obviously have 


(11) AM =AK. 


In other words, when considering the set of numbers constructible from a set, the 
initial set M can always be replaced by the field given by (10). By the way, the 
field K is mapped to itself under complex conjugation: 


(12) K=K. 


This is clear from (10), but we will justify it in detail for pedagogical purposes: By 
definition, K = Q@(A), and soQ =Q@C K and A= AC K. By Definition 1, then, 
Q(A) © K, that is, K C K. By complex conjugation we then get K C K = K. 
(Where have we used the fact that the complex conjugate of a subfield of C is also 
one?) 
F4, Let K be any subfield of C with K = K. 

(a) If z is the intersection of two distinct lines in Li(K), then z € K. 

(b) If z is an intersection of a line in Li(K) with a circle in Ci(K), then 


(*) there exists w € C with w? € K andz € K(w). 


(c) [fz is an intersection of two distinct circles in Ci(K), condition () again holds. 


These statements simply reflect the well-known fact that the analytic counter- 
part of the elementary construction steps (i)—-(iii) can only lead to solving linear 
or quadratic equations. Therefore we postpone the proof of F4 and instead derive 
from F4 certain consequences of great import to the constructibility problem. First 
we equip ourselves with appropriate terminology: 


Definition 2. Let E be an extension of the field K. 
(a) We say that E arises from K by adjoining a square root if there exists w € E 
such that 
w>eK and E=K(w). 
We call w a square root of the element v := w* of K, and we write w = /¥. 


(b) We say that F arises from K by successively adjoining square roots if there 
is achain K = Ko C K; C::-C Ky, = E of subfields K; of E where each 
K; is obtained from K;_-, by adjoining a square root. 


Examples. 1. E = Q(,/2) is obtained from Q by adjoining a square root. 


2. E = Q(z), where z = e?7*/3, is obtained from @ by adjoining a square root. 


For since z = —4 + si V3, we have @(z) = Q(V-3). 
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3. E = Q(e?7"/5) is obtained from @ by successively adjoining square roots. 
This is not immediately obvious, but can be seen as follows: The complex 
number w = e27"/5 satisfies wt* + w3 +w2+w+1l= 0, since w> = 1 and 
w # 1. Dividing by w? yields 


(13) wetw'+1t+w'+w?=0. 
Now set z:= w+w!; then (13) becomes, via (w+ w!)? = w27+24+w?, 
(14) z74+z-1=0. 


This equation has the solutions z = —4 ae 55 . The field K; := Q(z) therefore 
satisfies 


K, = Q(V5). 


But E = K,(w), and w obviously satisfies the quadratic equation 
(15) w?—-zw+1=0, 


whose coefficients lie in K,. Thus, as can be seen from the quadratic formula, 
E is obtained from K, by adjoining a square root of z? —4, which lies in Ky. 


The following theorem fulfills our first goal, the reduction of the geometric 
problem of constructibility with ruler and compass to a purely algebraic problem. 


Theorem 1. Suppose M €C contains 0 and 1. Set 
K:=Q(MUM). 


For a given z € C, the following statements are equivalent: 
i) z € AM, that is, z is constructible from M with ruler and compass. 


(ii) z lies in a subfield E of C obtainable from K by successively adjoining square 
roots. 


Proof. (ii) => (i): By assumption, there exists a finite chain of subfields of C, say 
K=Kj)CK,C::-CKkyn =E, 


satisfying K; = Kj—1(w;) with w? € Kj, for each 7, and also z € E. We also 
know that Ko = K CAM. Now consider the field K; = K(w 1), with w? e K. By 
F3, w, lies in A.M because w? does. Since A.M is a field, we have K(w;) CAM, 
that is, K; C AM. Analogously we get K2 C A.M, and so on until we finally get 
E= Km CAM. Since z € E we have ze AM. 


(i) => (ii): We first consider a z € C arising from M by applying only one of the 
elementary construction steps (i), (ii), (iii). Now we make use of F4. We conclude 
that z € K in the case of step (i); in cases (ii) and (iii) we get z € K(w), where 
w € C is such that w? lies in K. We claim that in each case z lies in a subfield 
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K’ of C obtained from K by successively adjoining square roots and satisfying the 
condition 


(16) K’= K’. 
In case (i) this is clear; see (12). In cases (ii) and (ili) consider the field 
(17) K' = K(w, 0) = K(w)(). 


It clearly satisfies condition (16). Since w? € K, we have ®* = w2e K= KC K(w), 

so by (17) K’ is indeed obtained from K by successively adjoining square roots. 
For an arbitrary z € A.M the assertion follows by induction on the number of 

elementary steps needed to construct z. 


Now that Theorem | has accomplished the desired algebraization of our problem, 
the four classical constructibility problems listed earlier can also be reformulated 
algebraically: 

Doubling of the cube: Does */2 lie in a subfield of C obtainable from @ through 
the successive adjunction of square roots? 

Quadrature of the circle: Does the number z lie in a subfield of C that can be 
obtained from Q by successively adjoining square roots? 

Construction of a regular n-gon: For what natural numbers n is the complex 
number e?7!/” contained in a subfield of C obtainable from Q by the successive 
adjunction of square roots? (By Example 3 after Definition 2, this is certainly the 
case form = 5: thus a regular pentagon is constructible with ruler and compass. You 
are encouraged to derive a practical construction from the calculation in Example 3; 
it is not hard to do.) 

Angle trisection: Let g be any real number. Is it the case that the complex 
number e’?/3 always lies in a subfield of C obtained from the field 


(18) K = Q(e'”) 

by successively adjoining square roots? (Note: For z:= e'? we have 7 =e? =z" !, 
so Q(z,Z) = Q(z), and (18) does indeed represent the right ground field for the 
purposes of Theorem 1.) 


5. We now carry out the proof of F4. We start with an arbitrary subfield K of C 
satisfying K = K. 


(a) An arbitrary line g in R? = C is given by an equation 
(19) g = {29 + tz, |t € R}, 
where Zo, 2; € C and z; 4 0. If g € Li(K) we can assume that zo, z; € K. Now 


suppose that g’ = {zy + ¢’z) | t! € R}, with 25,2, € K, is another line in Li(K), 
distinct from g, and that z € gM g’. There exist uniquely determined real numbers 
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t,t’ such that z = z9 + tz; = 2) + t'z,. Looking separately at real and imaginary 
parts, we see that (¢, t’) is the unique solution of the system of linear equations 


tx; —t'x} = XxX9—Xo, 
tyi—t'y, = Yo-Yo. 


Multiplying the second equation by i we get a system of linear equations over K. 
It follows that t,t’ € K andsoze K. 


(b) An arbitrary circle c in R* = C is given by 
(20) c= {z|(z-a)(z—a) =r°}, 


with a € C and r > 0 real. Suppose ¢ € Ci(K). Then a € K. Since ¢ also contains 
an element b from K, and since r? = (b—a)(b—@) and K = K, we have r? € K. 
Now let g € Li(K) be given by (19) and suppose z € gM c. Then there exists f € R 
with z = Zo + fz, and, in view of (20), 


(Zo + 2z,;t—a)(Zo +7,t-a) = r?, 
Multiplying out and dividing by |z;|*, we get an equation 
+ pt+q=0, 


with p andq in K. Then w:=t+ +p satisfies w* € K, and since z = zy + tz, and 
K(t) = K(w) we have z € K(w). 
(c) Now let two distinct circles c, and cz be given and suppose z € ¢; M cz. Then z 
satisfies a system of equations of the form 

(z—a)(Z—4@) =r’, 


21 : 
oe (2 —b)(Z—b) =”, 


with a,b,r?,s? € K anda #b. Subtracting one equation from the other yields 


(22) z(b—@)+Z(b—a) =c, 
with c =r? —s?—aa+bbe K. Solving equation (22) for Z and substituting the 
resulting value into the first line of (21), we get for z a quadratic equation with 
coefficients in K. The assertion follows immediately. 


6. The algebraic translation of the constructibility problem (Theorem 1) thus leads 
us to a more detailed study of the extensions of a given field K. In this forthcoming 
investigation the following statement is both simple and fundamental: 


FS (Dedekind). Let K be a field and E an extension of K. Then E can be regarded 
as a vector space over K. 
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Proof. This is clear: we consider on EF the existing addition operation and a scalar 
multiplication K x EK — E defined simply by restricting the existing multiplication 
map EF x E —> E of the field £. With these operations EF of course obeys all the 
axioms of a K-vector space. (Incidentally, by considering on the K-vector space 
F thus obtained the original multiplication operation, we can make EF into a K- 
algebra; for this concept see LA I, p. 87.) 


By regarding extensions of a field K as K-vector spaces, one gains access to 
the powerful methods of linear algebra, which have demonstrated their fruitfulness 
over and over in many different areas of mathematics and applications. 


Definition 3. If E is an extension of the field K, we denote by 
[E: K] 


the dimension of the K-vector space E. Instead of [E: K] we can also write E: K. 
This number is called the degree of E over K. 


Examples. (1) As an R-vector space, C = R?, so C:R = 2. 
(2) We have Q(7):@ = 2; see the example after Definition 1 and observe that 
i€Q. 
(3) We will see in Chapter 2 that Q(4/2):Q = 3. 


(4) Because R is uncountable, the degree R:Q cannot be finite. 


The usefulness of the viewpoint introduced in F5 becomes apparent already from 
the next statement: 


F6. Let E be an extension of the field K, and suppose that 1+ 140 in K. Then 
these two statements are equivalent: 

Gi) E:K=2. 

(ii) E is obtained from K by adjoining a square root that is not already in K. 


Proof. Suppose (i) holds, and let a be an element of F not belonging to K. Since 
E:K = 2, the set {1,a@} is necessarily a K-basis of E. In particular there is a 
relation of the form 


a? + pa+ql=0, with p.géK. 


For w:=a+ +p we then have w? = +P? -4 e€ K. Since E = K(a) = K(w) this 
implies (ii). 

Suppose, conversely, that E = K(w) with w? =: d € K and w ¢ K. Clearly 
E’ := {a+ bw |a,b € K} is a subring of E containing K. To prove (i) therefore 
we just have to show that for every a+ bw #0 in E’ the inverse (a + bw)! also 
belongs to E’. This follows from 


(a+bw)(a—bw) =a*—b*d eK, 


because we know (from w = /d ¢ K) that a2—b?d £0. 
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Thanks to F6 we can recast Theorem 1 as follows: 


Theorem I’. As before, suppose M CC contains 0 and 1, and set K:=Q(MUM). 
Then these two statements are equivalent: 


Gi) ze AM. 
(ii) There is a finite chain K = Ko C Kk, C--: © Kym of subfields of C such that 
Zz € Ky and 
Kj: Kji-1 =2 forl<i<m. 


This result suggests that we should study the relationship between the degrees 
of the extensions in the diagram 
E 
F 


K 


(23) 


whose meaning is that F' is an extension of K and £ is an extension of F. 


F7 (Degree formula). Let E be an extension of K and let F be a subfield of E 
containing K. Then 


(24) [E:K|]=[E: F]-[F: K]. 
Proof. If E: K is finite, so are E: F and F: K, of course. Now assume 
(25) F:K=m and E:F=n, 


with m and n natural numbers. We show that F': K is also finite and satisfies 
equation (24). Indeed, by (25) there is an isomorphism F ~ K™ of K-vector spaces 
and an isomorphism E ~ F” of F-vector spaces. This results in an isomorphism 


Ew F"~(k")= k™ 


of K-vector spaces. It follows that E : K =mn, so equation (24) holds. The essential 
content of (24) is thus proved. (Incidentally, it is clear how to modify the argument 
in case any of the degrees are infinite, so as to prove (24) regarded as an equality 


between cardinals.) But in addition we establish the following: If a1,..., 0 form 
a basis of F over K and f,..., 6, form a basis of E over F, the elements 
(26) (a; Bj )1<i<m, 1<j<n 


form a basis of the K-vector space £. For any a € EF can be written in the form 
a = )); 6; Bj with coefficients bj; € F, which in turn can be written as bj = )); aij Qi 
with aj; € K; it follows that a = 7; (S; aij) Bj = »i,; 4% Bj, so the elements 
in (26) span the K-vector space EF. Since E: K = mn they must form a basis. 
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We immediately get, as a consequence of F7 (and F6): 

F8. If E is obtained from K by successively adjoining square roots, 
E:K=2" for some integer m = 0. 

Remark. The converse of F8 is unfortunately not true; see Appendix, §5.7. 

At any rate, F8 gives us a necessary condition for a number to be constructible: 
F9. Let K be a subfield of C with K = K. If z € C is constructible from K, 
(27) K(z):K is a power of 2. 


Proof. Take z € AK. By Theorem 1, z lies in an extension E of K that can be 
obtained from K by successively adjoining square roots. By F8, E: K = 2” is 
a power of 2. Since z € E we have K(z) C E. Because of the degree formula 
[E: K]=[E: K(z)]-[K(z): K], the integer K(z): K is a power of 2, since it divides 
E:K. 


As remarked, the converse of F9 is not generally true. Only in Chapter 11 will 
we be able to explain how condition (27) can be modified to give a necessary and 
sufficient condition for the constructibility of a number. 

Regarding the four classical constructibility problems listed near the beginning 
of this chapter, F9 tells us that we should be investigating the following questions: 


a) Q(e!?/3):Q(e?) =? 
B) Q(4/2):0 =? 
y) Q(z):Q =? 
6) Q(e27/"):Q =? 
If we can show, for example, that Q(</2):Q@ = 3, this would prove that the 
problem of the doubling of the cube is insoluble with ruler and compass. 
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Algebraic Extensions 


1. Let K be a field and E an extension of K. One writes this assumption in short 
as 
Let E/K bea field extension, 


and the word “field” is often omitted when it can be inferred from the context. 


An element a of E is called algebraic over K if there exists a polynomial 
f(X) #0 in KX] such that 


f(a) = 0. 


If aw is not algebraic over K, we say that @ is transcendental over K. 


Remarks. (a) If K =Q and E =C, the elements of E algebraic over K are called 
simply algebraic numbers, and the elements of F transcendental over K are 
called transcendental numbers. Example: a := 2 is an algebraic number, 
since a is a root of the polynomial ¥?—2 € Q[X]. 

(b) The set of algebraic numbers is countable (since Q[X] is countable and any 
nonzero polynomial in Q[X] has finitely many roots in C). Therefore the set 
of transcendental numbers must be uncountable. To actually be able to exhibit 
a transcendental number is a different (and much harder) matter. 


Theorem 1. Let M be a subset of © containing 0 and 1. Any point z € AM is 
algebraic over K:= Q(M UM). 


The proof will be given later in this chapter. But first we quote a famous result: 
Theorem 2 (Lindemann 1882). The number x is transcendental. 
Corollary. The quadrature of the circle with ruler and compass is impossible. 


Proof. If it were possible, we would have a € AQ; by Theorem 1 then z would be 
algebraic, which by Lindemann’s Theorem is not the case. 


Lindemann’s Theorem can be proved using relatively elementary algebraic and 
analytic arguments, but the proof is on the whole quite intricate. We will go into it 
later on (Chapter 17). 
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2. Now we start our study of field theory with the following statement: 


F1. Let E/K be a field extension. Ifa € E is algebraic over K, then 
K(a): K < oo. 

Proof. Suppose there exists a nonzero polynomial 

(1) f(X) =X" + ay X" "| +--+ +.a9 € K[X] 


such that f(a) =0; we have assumed without loss of generality that f is normalized 
(has leading coefficient 1). There exists a unique homomorphism of K-algebras g 
from the polynomial ring K[X] into E such that p(X) = a (see page 21); its image 


R=imgCE 


consists precisely of those elements of F that can be written as polynomial ex- 
pressions g(a) in @ with coefficients in K. But in writing such an expression we 
immediately see from the relation 


(2) a” = —(dn_1a"! +++» +. aa +9) 
that only terms of degree less than v are needed, so in fact 
(3) R={eotcat-:-+¢,-10"! | oj € K}. 


Thus, as a vector space over K, the dimension of R is at most n. Since R, being 
a subring of £, has no zero-divisors, a simple argument (given a bit further down) 
shows that R is actually a field. It follows that K(@) C R (using the definition of 
K(qa)), and therefore that R = K(a). From (3) we then get 


(4) K(a) = {co Heya ++++ + Cpa"! | Gj € K}. 
In particular, 
(5) K(a):K <n. 


F2. Let R be an integral domain (that is, a commutative ring with no zero divisors 
and with | £0), and let K be a subfield of R. If R is finite-dimensional as a K-vector 
space, R is a field. 


Proof. For a given a #0 in R, consider the map 4: R— R given by multiplication 
by a, namely, 4(x) = ax for all x in R. Then / is an endomorphism (linear map) 
of the K-vector space R. Since R has no zero-divisors, / is injective. Because R is 
assumed finite-dimensional over K, it is also surjective. In particular, there exists 
b € R such that ab = 1. 


Remark. It can be proved in an analogous way that an integral domain that has 
finite cardinality is a field. 
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3. Let E/K be a field extension, and let a € E be algebraic over K. Consider 
on the K-vector space K(a) the endomorphism / defined by multiplication by a. 
The minimal polynomial of / is called the minimal polynomial of a over K, and we 
denote it by 

MiPox (a). 


This is the lowest-degree normalized polynomial in K[X] that has a as a zero. (That 
there can be only one such polynomial is clear: if f, g are both normalized and of 
degree n, the degree of f — g is less than n.) The degree of { = MiPog(K) is also 
called the degree of a over K, and is denoted by [a: K]. 


Example. Consider E = C, K = Q and a = e?7"/3. Then @ is a root of ¥?—1. But 
X¥3?—-1=(X—1)g(X), with g(X) = X¥7 + X¥ +1; since w #1, we have g(a) = 0. 
Let f = MiPox (a); we claim that f = g. Otherwise necessarily deg f < deg g, so 
f could only be of the form f(X) = X —a, which is impossible since a € R. 


F3. Let E/K be a field extension and let a € E be algebraic over K, of degree 
n:= [a: K]. The elements 


(6) Lor sang 
of E form a basis of K(a) over K. In particular, 
(7) K(a): K = [a: K] = deg MiPox (a). 


Proof. Let f(X) = X" +---+a1X +o the minimal polynomial of w over K. We 
know that 
K(a): K <n; 


n 


see (5) in the proof of F1. There remains to show that 1, a, a?,...,a"—! are linearly 


independent over K. Suppose there is a relation 


n-1 
(8) yaa’ =0 witha eK. 
i=0 


Set g(X):= aa c; X'. If some c; in (8) were nonzero, g(X) would be a nonzero 
polynomial in K[X] of degree less than 7 and vanishing at w. Contradiction! 


4. Let E/K be a field extension and assume a € E is algebraic over K. Is it the 
case that any 6 € K(q) is also algebraic over K? 


Definition. An extension E'/K is called algebraic if every element of E is algebraic 
over K. An extension E/K is called finite if E: K < oo. 


Remarks. C/R is a finite extension, since C:R = 2. The extension R/Q is not 
algebraic; see Remark (b) in Section 2.1. 


An extension E/K is called transcendental if it is not algebraic. 
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F4. [f an extension E'/K is finite, it is also algebraic; for each B € E the degree 
[B: K] is a divisor of E: K. 


Proof. Let E/K be finite of degree n. Given B € E, the n+ 1 elements 1, 6, 
B?,..., B” of the n-dimensional K-vector space E are linearly dependent. Therefore 
there exist d9,41,...,dy € K, not all zero, such that 


dol +a,B +-+-+a,p”" =0. 


Thus 6 is algebraic over K. By F3, [8: K] = K(B): K, and K(B): K is a divisor 
of E: K by the degree formula (Chapter 1, F7). 


We now can easily answer in the affirmative the question asked at the beginning 
of this section. 


F5. Let E/K be a field extension. If a € E is algebraic over K, the extension 
K(a)/K is algebraic. 


Proof. If a is algebraic over K, we know from F1 that K(a)/XK is finite. But every 
finite field extension is algebraic, by F4. 


Together, Fl and F4 afford the following criterion: 


F6. Let E/K be a field extension. An element a of E is algebraic over K if and only 
if K(a)/K is finite. 


Now it is a cinch to prove Theorem 1, which we can reformulate as follows: 


Theorem 1. Let M be a subset of C containing 0 and 1. Let K = Q(M UM). The 
field extension ).M/K is algebraic. 


Proof. Take z € A.M. From F9 of Chapter 1 we know that K(z): K < oo. Then F6 
says that z is algebraic over K. 


Remark. The converse of F4 is not true: Not every algebraic extension is finite. 
This will soon become obvious. In fact a counterexample comes up naturally in our 
context: If E = A{0, 1} is the field of all numbers constructible from {0, 1} with 
ruler and compass, the field extension E /@ is algebraic but not finite. (With what 
we know so far this is not very easy to prove, but it’s worth thinking about; see §2.5 
in the Appendix.) 


Among algebraic extensions, finite extensions can be characterized thus: 


F7. Let E/K be a field extension. The following conditions are equivalent: 


(i) There are elements a1,...,Qm of E, finite in number and algebraic over K, 
such that E = K(a,...,Qm). 


(ii) E/K is finite. 
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Proof. (ii) => (i) is clear; all we need to do is choose a basis @1,...,Q@m for E/K. 
Then we actually have E = Ka, +---+ Ka, and by F4 all the a; are algebraic 
over K. 

To show (i) = (ii) we use induction over m. For m = 0 there is nothing to 
prove. Assume that (i) holds for some m > 1 and set 


K’ = K(a, oS ,Am—1)- 


Then E = K’(am). Since @» is algebraic over K, it is a fortiori algebraic over the 
larger field K’. By Fl this implies FE: K’ < oo. But by the induction hypothesis, 
K'/K is finite. The degree formula (Chapter 1, F7) then implies that E/K is finite. 


5. Let E/K be a field extension. A subfield L of E containing K is called an 
intermediate field of the extension E/K. 


F8. Let E/K be a field extension. The subset 
F={aeE |a is algebraic over K} 


is an intermediate field of E/K. It is called the algebraic closure of K in E. In 
particular, the set of all algebraic numbers is a subfield of C. 


Proof. Take a, B € F. Consider the subfield K(a, 8) of E. By F7 the extension 
K(a, B)/K is finite (prove this again for practice). Now apply F4; all elements of 
K(a, B) are algebraic over K, so 


K(a, B) C F. 


The elements a+ 6, a—f, aB and 1/a (if a £ 0) lie in K(a, B), and thus also in 
F. So F really is a subfield of E. Clearly K C F, since any a € K is a zero of 
a polynomial ¥ — a € K[X] and therefore algebraic over K. This completes the 
proof. 


This proof qualifies as easy, but it’s only easy because we have the right notions 
at our disposal. Otherwise, would you be able to write down, at the drop of a hat, a 
nontrivial rational polynomial that vanishes at the sum of two numbers, given only 
rational polynomials vanishing at one and the other number respectively? 


F9 (Transitivity of algebraicness). Let L be an intermediate field of the extension 
E/K. If E/L and L/K are algebraic, so is E/K (and vice versa). 


Proof. Take 6 € E. By assumption f is algebraic over L. Let a, 1,...,Qy—1 
be the coefficients of MiPoz (8); then 6 is also algebraic over the subfield F := 
K(a0,@1,...,@n—1). By assumption all the a; are algebraic over K. Therefore we 
can apply F7 to conclude that F: K is finite. But F(6): F is also finite, by F6; 
therefore the degree formula gives 


F(B):K <o. 


Using F4 we see in particular that 6 is algebraic over K. 
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F10. Let E/K be a field extension and A a subset of E. If all elements of A are 
algebraic over K, the extension K(A)/K is algebraic. 


Proof. Clearly K(A) is the union of all subfields of the form K(M), where M 
ranges over finite subsets of A. By F7, each K(M)/K is finite and therefore also 
algebraic. Thus K(A) contains only elements algebraic over K. (Of course F10 
also follows directly from F8.) 


F11. Let E/K be a field extension, and L,, L2 intermediate fields of E/K. The field 
(9) LL. := Li(L2) = La(L1) 


is called the composite of L, and L2 in E. 
(a) If L1/K is algebraic, so is LyL2/Lp. 
(b) If L,/K is finite, so is L,L2/L2; moreover L,L2:Lz< Ly: K. 
(c) If L,/K and L,/K are algebraic, so is L,L4/K. 
(d) If L1/K and L2/K are finite, so is L1L2/K; if, moreover, the extension 
degrees n,; = L,: K andn2= L): K are relatively prime, we have L; L2: K = 
nyNn2. 


Proof. Part (a) follows from F10, taking (9) into account. Part (c) therefore also 
follows, thanks to F9. Let L;/K and L2/K be finite. Assuming (b) already proved, 
we see from the degree formula that 


(10) [1 L2:K = (L1L2:L2)(L2: K) < (11: K)(L2:K), 


which is the first part of (d). Again from the degree formula we obtain that Lj L2:K 
is divisible by 1; and by no. If m1, m2 are relatively prime, L;L2: K is divisible by 
n4N2, which together with (10) gives the second part of (d). 

There remains to prove (b). Consider the set R of all finite sums of products ab 
with a € L;,b € Lo. Clearly R is a subring of FE containing L; and Ly. It is also 
clear that any basis of L;/K generates R as an L2-vector space R, so in particular 
R:L,< 11: K. If £1: K < ow, this implies that R is a field (see F2). It follows 
that R = L,L>, which concludes the proof. 
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Simple Extensions 


1. We have seen that in considering constructibility questions one must investigate 
certain algebraic field extensions, for example Q(</2)/Q in the case of the doubling 
of the cube and Q(e?7‘/”) /Q in the case of the construction of a regular n-gon. How 
can such extensions be described? What can be said about their degree? 

More generally, let £/K be a field extension and take a € E. How can K(a)/K 
be described? 


Definition. A field extension L/K is called simple if there exists an element a of 
L such that L = K(a). Such an @ is called a primitive element of L/K. 


Let K be a field. The polynomial ring KX] is an algebra over K, or K-algebra 
(for the definition of this notion see LA I, p. 87). Now, if E/K is a field extension, 
E can also be regarded as a K-algebra. Let a be an element of E'. Because of the 
universal property of the polynomial ring there exists a unique homomorphism of 
K-algebras 


(1) go: K|X|— E_ with 9(X) =a, 
namely the substitution homomorphism given by 

(2) g(X) = oa X'r> aja =! g(a). 
We will denote the image of g by K[a]. We have 

(3) K[a] = {g(@) | g € K[X}}. 


Ka] is a subring of F, and indeed a subalgebra of the K-algebra F. 


Fl. Let E/K be a field extension and take a € E. The following statements are 
equivalent: (i) a is algebraic; (ii) K(a) = K[a]; (iii) K[a] is a field. 


Proof. (i) => (ii): If w is algebraic over K, the field K(a@) can be described as in 
equation (4) of Chapter 2, so K(a) = K[a]. The implication (ii) > (iii) is trivial. 
Finally, if (iii) is true, a (if nonzero) has an inverse in K[a]; that is, 


(ap +aya+---+ay,0”) = 1 for appropriate a; € K. 


22 3 Simple Extensions 


Hence a is algebraic over K. 


2. Let E/K be a field extension and a any element of E. In connection with the 
description of K[a] in (3), we compare K[a] with the polynomial ring K[X]. In 
K(X], 


(4) So aiX' => b;:X' implies a; =; for all i. 


In contrast with XY in K[X], the element w may satisfy a nontrivial relation in K[a], 
so that in general we cannot deduce from )> aja! = > bja' that a; = b; for all i. 
For g1, 2 € K[X] the equation g1(a@) = g2(q@) is equivalent to (g1 — g2)(a) = 0, 
so we must study the kernel 


(5) I = Iy:= {g € K[X]| g(@@) =0} 
of the homomorphism ¢ of (1) and (2). 
Definition. Let R be any ring with unity. A nonempty subset J of R is called a 
(two-sided) ideal of R if 
Gi) aabeI > a+belT, and 
Gi) ael,xEe R= xa, axel. 


Thus a subset of R is an ideal of R if and only if it is a subgroup of the additive 
group of R and is mapped into itself by multiplication, whether on the left or on 
the right, with any element of R. 


The set 7 in (5) is an example of an ideal, called the ideal of relations of a. In 
general: 


F2. Suppose gp: R > R' is a ring homomorphism (of rings with unity, so 91 = 1). 
Then 

kerg = {ae R| ga =0} 
is an ideal of R. Conversely, if I is an ideal in a ring R, there is a (canonical) 


surjective homomorphism of rings 1: R —> R such that ker x = I. 


Proof. The first assertion is clear. (But incidentally, the image of g is generally 
not an ideal of R’.) The proof of the second assertion results from the following 
construction: 


The quotient modulo I. Consider the relation ~ on R defined as follows: a ~ b 
means a—b € J. Clearly this is an equivalence relation on R. Instead of a ~ b one 
generally writes (following Gauss) 


(6) a=b mod I, 


read “a and b are congruent modulo J”. This terminology is felicitous (among other 
reasons because it stresses that the relation ~ depends on the ideal 7). Now consider 
the equivalence classes determined by ~ , for which we use the notations 


G={d e€R\|ad~as=at+l={at+ylyel}. 
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Thus in the notation of (6) we have 
(7) a=b <— > a=b mod I. 


The set d is called the residue class of a mod I. We denote the set of all residue 
classes modulo J by 


R/T, 
read “R modulo J” or “R quotient 7”. Now take the map 
mw: R— R/I 
(8) ate ed 


which assigns to each a in R the residue class of a mod J (that is, the ~-equivalence 
class containing a). 

We claim that R/J is a ring in a natural way; more precisely, there is a unique 
ring structure on R/I that makes x into a ring homomorphism. 

Proof: Given G,b € R/T, we have no choice but to specify 


(9) G@+b:=a+b, Gb:=ab. 


But we have to check that this sum and product are well defined —in other words, 
that a’=a mod J and b’=b mod J necessarily imply that a’+ b’= a+b mod J and 
a'b' = ab mod I. So suppose that a’ = a+ x and b’=b+y, with x, y¢ I. Then 


a+b’=at+b4+(x4+y) ab’ =ab4+(ay+bx+xy). 


Since J is an ideal of R, the expressions in parentheses both lie in 7, which proves 
the claim. 
To prove F2 we now just have to show that J = ker z. But this is clear, because 


m(a)=0 < a=0 < a=0mod/J <> ael. 


We call R/I the quotient ring of R by I, or modulo I. (“Residue-class ring” 
is an alternative name for “quotient ring”.) When no misunderstanding is likely, 
one can simply write R instead of R/IJ. The map z in (8) is called the quotient 
homomorphism under I or the canonical map from R onto R/T. 

The role of the quotient ring in the description of ring homomorphisms is the 
following: 


F3 (Fundamental Homomorphism Theorem). Let g : R — R’ be a homomorphism 
of rings. There exists a unique ring isomorphism 


w:R/kerg — img 


such that the diagram 


(10) Sy vA 


R/kerg Ba img 
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commutes. Here m denotes the quotient homomorphism and t the inclusion of im@ 
in R’. In particular, there is an isomorphism 


(11) R/kerg = img. 


Analogous statements hold regarding (instead of rings) K-algebras, modules over 
rings, abelian groups, arbitrary groups, etc. 


Proof. We have no choice but to define w by setting y(@) = g(a). Is w then well 
defined? Yes, because @ = b implies the existence of x € ker g such thata=b+x, 
whence we get the desired equality g(a) = (b+ x) = (6) + g(x) = v(d). 
Clearly yw is surjective. We still have to prove that ker yy = 0, that is, y is 
injective: but from w(@) = 0 we get y(a) = 0, hence a € ker gy, so dG = 0 as desired. 
This reasoning applies wholly analogously to any type of algebraic structure; 
in most cases it is clear which subsets actually occur as kernels of the type of 
homomorphism in question, so the construction of the quotient structure carries 
over. In the case of a group G kernels are subgroups U of G that satisfy Ux = xU 
for every x € G; these are called normal subgroups of G. 


Applying the Fundamental Homomorphism Theorem to the situation considered 
at the beginning of the chapter leads to the isomorphism of K-algebras 


(12) Kla] ~ K[X]/Iw. 


F4. Let E/K be a field extension. Givena € E the following statements are equiva- 
lent: 
(i) @ satisfies no algebraic relation, that is, f(a) = 0 with f € K[X] implies 
f=0. 
(ii) @ is transcendental over K. 
(iii) K[o] ~ K[X] as K-algebras. 
(iv) K[o] is not a field. 


Proof. (i) <=> (i) is clear. Suppose (i) holds. Then Jy = {0}, so K[a] ~ 
K[X]/{0} ~ K[X] by (12), which implies (iii). Using a degree argument we see 
that the group of units of K[X] is K*, the multiplicative group of K; therefore (iii) 
=> (iv). The implication (iv) = (ii) is already contained in F1. 


We now look at the case where a is algebraic over K. The ideal I, of relations 
of @ is then nontrivial; in particular, it contains f := MiPox (a). For an arbitrary 
ge kX], 


(13) g(a)=0 =f divides g in K[X]. 
We recall the proof of this well-known fact. Division with remainder yields 


(14) g=qf+t+r_ with q,r € K[X], degr <deg f. 
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So if g(a) = 0, one concludes by substituting w in (14) that r(@) = 0, which (since 
degr < deg f and f = MiPox(qa)) can only happen if r = 0. Therefore g =q/f. 
Thus, according to (13) the ideal J, has the form 


(15) Iu = {af |q € K[X]} = K[X]f. 
This motivates the next definition: 


Definition. Let R be a commutative ring with unity. Given a € R, denote by (a) 
the ideal Ra = {ca|c € R} in R. This is the principal ideal of R generated by a. 
Instead of x= y mod (a), it is common to write x =y mod a. The quotient R/(a) 
is also denoted by R/a. The ideal {0} = (0) is often simply written 0. 


We summarize the work so far: 


Theorem 1. Let L/K be a simple algebraic field extension and a a primitive element 
of L/K. Put f = MiPox (a). The substitution homomorphism corresponding to a, 


K[|X|— L= K(a) = Ka], 
gives rise to an isomorphism (of K-algebras) 
K[X]/f — L= K(@). 
In particular, given g € K[X], we have g(a) =0 => f\lg. 
In the situation of Theorem 1, the isomorphism 
(16) Ka) = K[X/f 


gives a good description of a simple algebraic field extension: it all boils down 
to computing in K[X] modulo f. Crucially, this description also provides a hint 
for how to generate simple algebraic extensions of a given field. We address this 
question now. 


3. So let K be an arbitrary field and f a polynomial in K[X] of degree n > 1. 
Consider the quotient algebra 


(17) Kp := K[X]/f 


over K and denote by z : gt> g the corresponding quotient homomorphism. Since 
deg f = 1, the homomorphism of K-algebras 


(18) K => Ky with awa 


is clearly injective. Through this homomorphism we can regard K as a subfield of 
Kp: K © Ky. Now let a:=2(X) = X be the residue class of X mod /. Then x 
is also the unique algebra homomorphism K[X]— Ky mapping X to a. Therefore 
m(g) = g(a) for all g € K[X]. Thus, since z(f) = 0, 


(19) f(a) =0. 
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Thus / has @ as a root in Ky. Even more: given any g € K[X], 


(20) g(a)=0 <> fg in K[X]. 

Also 

(21) Ky = K[o] = {g(a) | g € K[X}}. 

More precisely, we claim that 1,a,...,«"—! form a basis of the K-vector space Ky, 


so this space has dimension n. Proof: Any element of Ky is of the form g(a) with 
g € K[X]. Division with remainder lets us write g = gf +, and substituting a we 
get g(a) =r(a) with degr <n—1. There remains to show that 1,a,...,@”~! are 
linearly independent. A linear relation co + cya +°+++ Cn-1a"! = 0 with c¢; € K 
results in h(w) = 0 with h(Y) = co +c, X +++++Cn-1X" ! € K[X], which is only 
possible if i = 0, because f divides h (look at the degrees). 

Is Ky a field? We will show that this is so if and only if f is irreducible. 
Recall that a polynomial f € K[X’] is irreducible if deg f > 1 and any factorization 
f = fifo with fi, fo € K[X] implies that f, ¢ K or fp € K. An irreducible 
polynomial is also called a prime polynomial. The following result is fundamental 
and well-known: 


Theorem 2. Let f be an irreducible polynomial in K[X]. If f divides gh for 
g, he K[X], then f|g or f |h. 


Proof. This assertion, which you’re surely familiar with, will follow from general 
considerations in Chapter 4. Here we give an ad hoc argument: Suppose that f, g,h 
contradict the theorem. Division with remainder gives g =qf +r. Since f| gh we 
have f |rh, so we might as well assume that deg g < deg f to begin with. Among 
all triples /, g,/ contradicting the theorem choose one where deg g is minimal. 
Since f = qg +r with degr < deg g we first get f|rh, and since the degree of g 
is minimal and less than that of f we next get r = 0. Because / is irreducible it 
follows that g is a unit—a contradiction. 


Theorem 2 was first formulated by Simon Stevin in 1585; the analogous state- 
ment for the ring Z is already in the works of Euclid (ca. —330). 


F5. Kp = K[X]// is a field if and only if f is irreducible in K(X]. 


Proof. Let Ky be a field. If f = fi fo in K[X] we have fi (a) fo(a) = f(a) = 0 
and therefore /i(a) = 0 or f2(a) = 0. Because of (20), either fy or fi lies in K, 
so f is irreducible. 

Conversely, assume that f is irreducible. We already know that Ky is finite- 
dimensional over K; keeping in mind Chapter 2, F2, we then just have to show that 
Kf is an integral domain. So let zh = 0. Since Zh = gh = 0 we get f|gh; by 


Theorem 2 this implies f|g or f|h, which is to say g = 0 orh=0. 


F6. Let E/K be a field extension and suppose a € E is algebraic over K. Then 
f := MiPox(q) is an irreducible polynomial in K|X]. Conversely, a normalized 
irreducible polynomial in K[X] that vanishes at a must equal MiPox (a). 
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Proof. (i) Because of (16) we have K[X]/f ~ K(q@), so f is irreducible by F5. 


(ii) Any polynomial g such that g(a) = 0 is divisible by /. If g is irreducible and 
normalized, it must equal f because / is also a normalized polynomial of degree 
at least 1. 


Example. Let K = Q, E =R,a = 2, g(X) = X3—2. We wish to show that 
g = MiPog(a). Since g(a) = 0 all we have to do, thanks to F6, is show that g is 
irreducible. Suppose g = gi g2 were a nontrivial factorization of g in Q[X]. Then 
one or the other factor, say g;, has degree 1. Being linear, g, has a zero f in Q. It 
follows that g(B) = 0, so B? = 2. This contradicts the fact that </2 does not lie in 
Q. (This is something we assume known; later we will be able to show that XY 3_9 
is irreducible without resorting to this fact, but rather as an immediate consequence 
of Chapter 5, F10.) 

Taking into account equation (7) of Chapter 2, we immediately get the corollary 


Q(7/2):Q =3. 


As a first fruit of our algebraic study of the constructibility problem, we obtain 
from this and from Chapter 1, F9: 


Theorem 3. <2 is not constructible from {0,1} with ruler and compass. Conse- 
quently, the Delian problem of the doubling of the cube is also not soluble. 


Here is an important field-theoretical application of the results from this chapter: 


Theorem 4 (Kronecker). Every nonconstant polynomial f (X) over a field K has a 
root in some appropriate extension of K. 


Proof. Since deg f = 1, there must be an irreducible polynomial g dividing f 
(consider all nonconstant factors of f and take one of least degree). If an extension 
of K contains a root of g it will also serve for /; therefore we assume without loss 
of generality that f is irreducible. Then Ky = K[X]/f is a field, by F5. Up to 
isomorphism Ky is an extension of K, and the image a of X is a zero of f; see 
(18) and (19). 


Kronecker’s Theorem is unsurprising from the point of view of modern algebra, 
and its proof is simple. Nonetheless it does remove one of the criticisms leveled 
by Gauss at earlier justifications of the Fundamental Theorem of Algebra, which 
was that Euler and Lagrange simply started off from the premise that a nonconstant 
polynomial always has roots (somewhere) and then sought to prove that these roots 
must be in C. Gauss wrote: “How these magnitudes, which we cannot even begin to 
visualize — mere shadows of shadows — are to be added or multiplied is something 
that surely cannot be grasped with the clarity that mathematics always demands.” 


4. We now consider a simple transcendental extension L/K. Let a be a primitive 
element of L/K. Then a must be transcendental (by F5 in Chapter 2), and so F4 
yields 

Kia] ~ K[X]. 
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What can we say about L = K(q) itself? (By F4 we know Ka] is not a field, so 
Ka) # K{o].) 
For brevity we set R = K[a]. We claim that 


(22) L={B/y|B.yveR, vy #90}. 


Proof: Let Q be the subset of L defined by the right-hand side of (22). Clearly L 
is a subfield of L, and K[a] = R C Q. This implies K(a@) C Q, so we get L = Q. 


Definition. Let E be a field and R a subring of E. The field of fractions of R in 
E is the intersection L of all subfields of & that contain R. One can express L 
exactly as in (22), the justification being the same as above. 


Now, is any integral domain R a subring of a field? 


F7 (Fraction field). Let R be a integral domain. There exists a field F and an 
injective ring homomorphism t: R — F with the following property: Ifk: R— E 
is any injective ring homomorphism from R into a field E, there is a unique ring 
homomorphism i.: F — E such that }ot = x — in other words, making the following 
diagram commutative: 


| ren 5 


(23) | vA 


R 


Such a field F is called a fraction field of R. It is uniquely determined up to 
isomorphism: more precisely, if F’ is another fraction field and’: R — F' the 
corresponding map, there exists a unique isomorphism ) : F — F' such that 


(24) | F 


commutes. Moreover, F is the field of fractions of tR in F, in the sense of the 
preceding definition. 


(Another name for “fraction field” is “field of quotients”. This use of “quotient” 
is not the same as in the expression “quotient ring” defined earlier; in a field of 
quotients the elements of the field are themselves the quotients.) 


Remark. In view of the uniqueness statement in F7, we talk from now on about 
the fraction field of R; we denote it by Frac R. For simplicity we will generally 
assume that R C Frac R, which entails no loss of generality. We then have 


Frac R= {a/b|a,be R, b #0}. 


The reason we were so punctilious in the statement of F7 is that this is a key example 
of solving a universal problem of the kind that one often comes across in algebra 
(and elsewhere). 
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Before proving F7, we state one more result: 


F8. Let R be a integral domain. If 1: R — K is an injective ring homomorphism 
from R into a field K, the field F of fractions of LR in K is a fraction field of R. 


Proof. Let k: R — E be an injective ring homomorphism of R into a field E. We 
define 1: F > E by setting 


A(ta/tb) = ka/xb. 


It is easy to check that A is well defined. It is also clear that 4 is a ring homomor- 
phism, and that in fact it’s the only one for which diagram (23) commutes. 


Proof of F'7. We first show the uniqueness statement. By assumption there exist 
homomorphisms A: F — F’ and 4’: F’ > F with Aor =0' and dol’ = 1. It 
follows that 4/001 =A’ ou’ =1, and thus, because of the uniqueness requirement, 
A’oh = idg; analogously we have 4 0X’ = idg. Therefore A is an isomorphism. 

In view of F8 what is left to show is that there is a field K and an injective ring 
homomorphism 1: R— K. For this consider the set M = {(a,b)|a,be R, b #0} 
with the relation ~ defined by 


(a,b)~ (c,d) means ad =be. 


It is easy to prove that this is an equivalence relation; let K = M/~ the set of 
equivalence classes. Denote the class of (a,b) € M by [a/b]. Define addition and 
multiplication on K as follows: 


[a/b] + [c/d] =|[(ad + bc)/bd],  [a/b]-[c/d] = |ac/bd]. 


Checking that these operations are well defined is left to the reader. It is easy to 
see that with these operations K becomes a commutative ring with unity; the zero 
element is [0/1] and the unity is [1/1]. The mapz: R > K defined by ¢(a) = [a/1] is 
a homomorphism. By definition, [a/b] = 0 = [0/1] if and only if a =0. In particular, 
t is injective. In addition, every [a/b] 40 in K has a multiplicative inverse, namely 
[b/a]. Therefore K is a field. 


The classical example of the construction above is the field of rational numbers 
Q = Frac Z. 
Other key examples arise as follows: 
Definition. Let K be a field and K[X] the polynomial ring over K. The field 
K(X) := Frac K[X] 
is called the field of rational functions in one variable over K. It satisfies 


L(X) 


BOE es 


pee xixy exo}: 


30 3 Simple Extensions 


thus every “rational function” — which is to say, every element of K(X)—is a 
quotient of polynomials. (If K is infinite, the elements of K(X) can really be 
seen as rational functions in the usual sense of sending each point in the domain of 
definition— here K minus some points — to its image.) 


F9. Let E/K be a field extension and take a € E. If o is transcendental over K, 
there is a natural isomorphism 


(25) K(a) ~ K(X) 
of fields (and of K-algebras). Conversely, if (25) holds, a is transcendental over K. 


Proof. If @ is transcendental over K, the homomorphism g : K[X]— K[a] given 
by gX =a is an isomorphism and so can be uniquely extended to an isomorphism 
go: K(X) — K(q) of the corresponding fraction fields (see F7). 

The converse part of F9 follows for instance from the fact that K[X]—and 
therefore also K(X) —is infinite-dimensional over K. 


Thus the simple transcendental extensions of a given K are all of one type, 
represented by K(X)/K. 


5. This is a good place for one more essential remark about fields. Let K be a 
field. For each n € Z, consider the n-th multiple nx = n1x of the unity lx in K. 
If nx #0 for all n 4 0, we say that K is a field of characteristic zero, and write 


(26) char K = 0. 


If, on the contrary, there is a natural number 7 such that nx = 0, and if p is the 
smallest such number, p is called the characteristic of K and we write 


(27) char K = p. 


Because (mn)x = mgnx, this p must be prime. For the moment, denote by Zx 
the subring of K consisting of all nx, for n € Z. Consider the uniquely defined 
homomorphism 


(28) g:Z—>K _ suchthat gl=I1x. 


Two cases can be distinguished: 


Case A: kerg 4 0. Then we are in situation (27) above. If 7 € ker g, division by p 
with remainder shows that n € pZ. It follows that kerg = pZ, so the Fundamental 
Homomorphism Theorem applied to g yields an isomorphism 


(29) Ze ~Z/pl. 


In particular, Zx has exactly p elements. Being a finite integral domain, Zx is a 
field! (See Chapter 2, Remark after F2.) 

Case B: ker g = 0. This occurs if and only if char K = 0, and the Homomorphism 
Theorem applied to (28) then gives an isomorphism 


(30) Ze XZ. 
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Definition. A field is called a prime field if it has no proper subfields. 
F10. (a) Any field K has exactly one prime field as a subfield. (This is called the 
prime field of K.) 
(b) Any prime field K is isomorphic either to Q or to some Z/pZ for p prime 
(depending on whether char K = 0 or char K = p > 0). 


Proof. (a) The intersection of all subfields of K is a subfield of K. It is the smallest 
subfield of K, hence a prime field. 


(b) Let K be any field and Ko its prime field. Clearly, Zx C Ko. Now, in case A 
above, Zx is itself already a subfield of K, so Ko = Zx ~ Z/pZ. In case B we 
have Ko = FracZx ~ FracZ = Q. 


Remarks. (a) It is customary to write just n instead of nx, and we will do so. But 
you should keep an eye open in each case for whether the n represents an integer 
or an element of K. 


(b) Clearly Q is a prime field (indeed, up to isomorphism, the only prime field of 
characteristic 0). For any prime number p, 
(31) Fp :=Z/pZ 


is a field (see Chapter 2, Remark after F2; naturally, to show that Z/pZ has no 
zero-divisors, it is necessary to use the well-known Euclidean result: if p is a prime 
dividing ab, then p divides a or b; see also Chapter 4). For a given p, the field F, 
is, up to isomorphism, the only prime field of characteristic p. As an example of a 
nonfinite field of characteristic p consider the field of rational functions F, (X’) over 
Fp. 


(c) It’s good to keep in mind the following trivial fact: If K is a subfield of F, then 
char EF = char K. 
6. To conclude this chapter we will go into another interesting characterization of 


simple algebraic field extensions. First we prove: 


F11. Let E/K be a simple algebraic extension with primitive element a. Let L be 
an intermediate field of E/K, and denote by 


g(X) =X" + BX" | +--+ BiX + Bo € L[X] 
the minimal polynomial of a over L. Then 


L = K(Bo, fi, FS. e , Bm-1)- 
Proof. Set F = K(Bo, B1,...,Bm-1). Trivially, F C L. Since g € F[X] we see 
that g is the minimal polynomial of a also over F’. Consequently, 
(32) F(a): F=L(a):L 


(see Chapter 2, F3). But since E = K(a) we get F(a) = L(a) = E, so (32) says 
simply that EF: F = EF: L. By the degree formula this means F': K = L: K, which 
(since F C L) demands that F = L. 
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Theorem 5. Suppose E/K is an algebraic extension. Then E/K is simple if and 
only if it possesses only finitely many intermediate fields. 


Proof. Denote by & be the set of all intermediate fields of E/K. 


(i) Assume E = K(a), and set f = MiPox(a). To prove the finiteness of &, 
consider the set 


= {g € E|X] | g is normalized and divides f in E[X}}. 


Now, it is well known that EX] enjoys unique factorization into prime factors (see 
for example LA II, p. 142, or the next chapter in this book). Therefore f has only 
finitely many normalized factors in E[X], and thus % is finite. Now consider the 
map 


(33) Doe 


that takes each g(X) = X” + Bm; X™~! +---+ Bo in & to the intermediate field 
K(Bo,...,Bm-1). Given L € &, the element g = MiPoz(q) is a factor of f in 
L{[X], therefore also in E[X]. Thus g lies in %. By F11, L is the image of g under 
the map (33). The map (33) is thus surjective, and since & is finite, so is &. 


(ii) The converse will be proved here only in the case where K has infinitely many 
elements. Suppose that & is finite. Then FE = K(a1,...,Q@n) with finitely many ele- 
ments a;; otherwise there would be an infinite chain of intermediate fields obtained 
by adjoining ever more elements. 

Now, to start an induction, we assume that E = K(qa, B). Since & is finite but 
K is infinite, there exist distinct A;,42 € K such that 


K(ia + B) = K(za + B) =: L. 


Then (A;a@ + B)— (Aza + B) = (A; —A2)a lies in L, and therefore so does a, and 
likewise 6. It follows that E = L = K(A,a + B), so that E/K is simple (with 
y =),a+ 6 as a primitive element). To prove the case E = K(a1,...,n), apply 
the induction hypothesis to write K(a1,...,@n-1) = K(a), so E = K(a, ay). 

For K a finite field the assertion follows from the fundamental theorem of the 
theory of finite fields, which we will study later (Theorem 2 in Chapter 9). 
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Fundamentals of Divisibility 


Throughout this chapter, 
R stands for a commutative ring with unity. 


Much of the content of this chapter is probably familiar to you from earlier courses. 
We nonetheless lay it out here because of its fundamental importance; in connection 
with the problems pursued up to now, we will be particularly interested in the 
question of irreducibility of polynomials. 


1. Given elements a,b in R, we say that a is a divisor of b (or divides b, or that b 
is divisible by a) if there exists c in R such that b = ca. In this case we write 


(1) a|b. 


The negation of (1) is denoted by 


(2) atb. 

The divisibility relation satisfies some obvious rules: 
(3) a\a_ (reflexivity); 

(4) a|b and b|c => alc (transitivity); 
(5) l|a, a|0; 

(6) a|b and c|d => acl\bd. 


Item (5) says that | is a minimal and 0 is a maximal element for the divisibility 
relation. Divisibility is compatible with addition in the following sense: 


(7) a|b anda|lc => albt+e. 
If R is an integral domain, 


(8) ac|bec => alb for c 40. 
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Elements of R™, that is, units or invertible elements of R, can be characterized thus: 
(9) ell <== ce R*. 


Definition 1. If a|b and b|a, we say that a is associated to b and write a = b. 
Clearly = is an equivalence relation on R. 


F1. In an integral domain R we have a = b if and only if there is a unit ¢ of R such 
that b = ea. 


The simple proof is left to the reader. 
In the ring Z we have a = b if and only if a = +b. But in general an integral 
domain has more units than just 1 and —1: 


Examples. (a) If R is a integral domain, so is the polynomial ring R[X], and 
R[X/* = R*. 
(b) For the subring R = Z[i] of C we have R* = {+1, +i}. 
(c) For a field K we of course have K* = K ~ {0}. 
(d) For the subring R = Z[V2] of R we have R* = {+(1 + V2)/ | j © Z}. (The 
proof of this is not totally straightforward and is left to the reader as a more 
challenging exercise.) 


Definition 2. Let a,,...,d, be given in R. An element d € R is called a greatest 
common divisor (gcd) of a1,..., dn if the following conditions are satisfied: 

(i) d is acommon divisor of a1,...,@y. 

(ii) Every common divisor of a1,...,@ also divides d. 
We say that the elements a1,...,@, are relatively prime if 1 is a gcd of a1,...,ay. 
The notion of the least common multiple (lcm) of a,,...,@, is defined analogously. 
F2. Any two gcd’s of a1,..., An are associated to one another. Likewise for any two 
lcm’s of a1,...,@n. 


This follows immediately from the definitions. 
But how about the existence of a gcd or lcm for given elements of R? 


2. In investigating divisibility questions it is relevant to consider in connection with 
an element a of R the set of its multiples, i.e., the principal ideal generated by a: 


(a) = Ra={xa|xe R}. 
Clearly, 
(10) alb <=> (6)C(a). 


This translation of the divisibility relation into a simple inclusion relation is very 
fruitful. We have, for example, 


(11) a=b <=> (a)=(b). 
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Moreover: v is a common multiple of a and 5 if and only if (v) C (a) (6). From 
this we deduce easily that 


(12) m isanlemof a,b => (a)N(b) = (mM). 


Thus there exists a lowest common multiple for a, b if and only if the ideal (a) N(b) 
is a principal ideal. 


F3. Let I, and I be ideals of R. Then I, N In and I, + In := {a + @2 | a; € Tj} 
are also ideals of R; and in fact I, N In is the largest ideal of R contained in I; and 
In, and I, + Ip is the smallest ideal of R containing I, and I. 


Of course similar statements hold for the intersection 7; M---M J, and the sum 
I, +-+-+ J, of more than two ideals. In the case of principal ideals we also use 
the notation 


(13) (ay,...,4n) = (41) +--+ + (Gy). 


This set consists of all R-linear combinations of a1,...,dy. 

So ¢ is acommon divisor of a, 6 if and only if (a) + (b) € (t). Now, (12) does 
not have a complete analog for the gcd (why not?); but if (a) + (b) is a principal 
ideal, say (a) + (b) = (d), then d is a gcd of a,b. (We already know that d is a 
common divisor of a and b; if ¢ is another, then (d) C (ft), so t|d.) 


Definition 3. An integral domain R is called a principal ideal domain, or PID, if 
every ideal of R is a principal ideal. 


F4. Ina principal ideal domain R any tuple of elements a,,...,@n of R has a gcd. 
If d is a gcd of a1,..., Qn, it can be represented as 
(14) d=X1a, +--+ + Xndy 


for appropriate x; € R. 


Proof. Given a1,...,@y, we use the assumption that R is a PID to find d such that 
(15) (a1) +++ + (an) = (d). 


This means d is a gcd of the a;, by the argument preceding Definition 3; moreover, 
d clearly has a representation of the desired from. And so does any other gcd d’ of 
the a;, since (d’) = (d), by F2 and (11). 


Thus in a principal ideal domain not only is the existence of a gcd for any 
a1,...,@n assured, but it’s true to boot that any gcd has an additive representation 
of the form (14), which is astonishing. But none of this would help if we could not 
prove the existence of interesting principal ideal domains... 


FS. The ring Z of integers is a principal ideal domain. 
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Proof. Let I be an ideal of Z; we may except the trivial case J = (0). Among all 
nonzero elements of J, let a be one with smallest absolute value |a|. We claim that 
I =(a). Obviously (a) C I. Now let b € J. By considering division with remainder 
we see that there exist g,r € Z such that 


b=qa+r and |r| <|al 


(we can even demand that 0 < r < |a| or alternatively that —S Ia <r< tla ). 
Because r = b — qa, this r is an element of 7. If r were nonzero we'd have a 
contradiction with our choice of a, because |r| < |a|. It follows that b = ga, and 
therefore b € (a). 


A study of this proof leads to the following generalization: 


Definition 4. An integral domain R is called a Euclidean domain if there exists a 
map v: R > NU {0} such that v(0) = 0 and that, for every a,b € R witha ¥ 0, 
there exist g,r € R with 


b=qa+r and v(r)<v(a). 
Such a map v is called a Euclidean valuation on R. 


Examples. (i) R=Z with v(a) = |al. 
(ii) If K is a field, R = K[X] is a Euclidean domain, with valuation v defined by 
setting v(0) = 0 and v(g) = (deg g)+ 1 forg £0. 


F6. Every Euclidean domain R is a principal ideal domain. 


This is proved exactly like the case R = Z of F5. 


3. We now generalize the familiar notion of prime numbers in Z and irreducible 
polynomials in K[X]. 


Definition 5. An element z of R is called irreducible if x € R* and 
(16) z=ab => aé€R* or be R*. 


Remarks. (1) The irreducible elements of Z are precisely the prime numbers p and 
their negatives — p. 

(2) By the Fundamental Theorem of Algebra (see for example LA I, p. 191), we 
know that the only irreducible polynomials in CLX] are the linear polynomials. As an 
exercise, deduce the following: Apart from linear polynomials, the only irreducible 
polynomials in R[X] are those of the form f = aX*+bX +c with b* —4ac <0. 


(3) A divisor a of b is called proper if it is neither a unit of R nor an element 
associated to b. Thus a nonzero element z is irreducible if and only if it is not a 
unit and has no proper divisors (here we’re assuming that R is an integral domain). 
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Definition 6. We say that a €¢ R can be decomposed into irreducible factors if it has 
an expression of the form 


(17) d= ,M2...m, with ¢e € R* and each 7; irreducible. 


(Here we allow r = 0, in which case (17) is to be read as saying that a = el = €). 
An integral domain where every a 4 0 has a decomposition into irreducible factors 
is called a factorization domain. 

We say that a has a unique decomposition into irreducible factors if it has a 
decomposition into irreducible factors and the following uniqueness condition holds: 
If in addition to (17) we have another such decomposition 


(18) Se ee 


then r’ = r and, after a permutation, 2; = 2; for 1 <i <r. An integral domain 
where every a 4 0 has a unique decomposition into irreducible factors is called a 
unique factorization domain (UFD). 


F7. For a factorization domain R, the following conditions are equivalent: 
(i) R is aunique factorization domain. 


(ii) For any irreducible element x of R we have 
(19) wtlab > anlaora|b. 


Proof. (i) => (ii): We may as well assume a,b 4 0. Given factorizations a = 
em, ...my and b = &7, ... is of a and b into irreducible factors we get for ab the 
factorization ab = cém, ...1,7,...i5. Now, if 2 | ab, there is a decomposition of 
ab into irreducible factors where 2 appears. From the assumption it follows that 
zr is associated with one of the elements 71,...,7;, 71,..., 7s. Therefore z is a 
divisor of a or b. 


(ii) => (i): Assume that (17) and (18) are true and that r > 1, the case r = 0 
being trivial. Now, zr, is always a divisor of the product on the right-hand side of 
(18). Assumption (ii) then implies that 2; must divide one of the 7; —let’s say 
a. Then there is a unit 7 such that 2} = nz. By cancellation of 2; we then get 
€I2 ... 1, = €'nw,...7,/. The assertion follows by induction. 


Definition 7. An element z in R is called prime in R, or a prime of R, if it is not 
a unit and it satisfies (19). 


Remarks. (1) Clearly, in an integral domain every nonzero prime is irreducible. 


(2) F7 suggests a question: Under what circumstances are the irreducible elements 
of an integral domain necessarily prime? That this is not always the case can be 
seen from the example of R= Z[V—5], where 2 is irreducible but not prime. (Prove 
this as an exercise; notice that 6 = 2-3 = (1 + /—5)(1— /-—5) in R.) 


F8. An integral domain R is a unique factorization domain if and only if the follow- 
ing two conditions are satisfied: 
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(i) Every chain (a1) © (az) © +++ © (Gn) © (Gn41) © «++: of principal ideals is 
stationary, that is, (aj) = (ay) for some n and all j = n. (This is the ascending 
chain condition for principal ideals.) 


(ii) Every irreducible element of R is prime. 


Proof. We show first that (i) implies that any nonzero a in R can be decomposed 
into irreducible factors. Let M be the set of all ideals (a) £0 such that a has no such 
decomposition, and assume that M #@. Then M has a maximal element; otherwise 
there would be a nonstationary chain (a1) $ (a2) S++ S (dn) S (an) S (Gn41) g tee, 
in contradiction with assumption (i). So let (a) be maximal in M. The generator 
a can be neither irreducible nor a unit. Thus a = bc with (a) G (b) and (a) g (c). 
Because (a) is maximal, both 6 and c have decompositions into irreducible factors. 
But then the same is true of a = bc, contradicting the assumption that (a) « M. 

If condition (ii) is satisfied as well as (i), we see from F7 that R is a UFD. 

Conversely, assume that R is a UFD. Then (ii) is immediately true, by F7. Let a 
and t be elements of R with (0) ¥ (a) C (¢) ¥ (a), and suppose that a satisfies (17). 
From the uniqueness of the decomposition into irreducible factors we conclude that, 
since ¢ is a proper divisor of a, it has (after reordering the 2;) a decomposition of 
the form ¢ = e’11... 75, with s <r. From this one easily concludes (i). 


F9. Every principal ideal domain is a unique factorization domain. 


Proof. Let R be a PID. We will use the characterization of UFDs in F8. Consider 
a chain (a1) © (a2) €--- of principal ideals. Let J be the union of all the (a;). 
It’s easy to check that J is an ideal of R. By assumption, it is a principal ideal, 
I = (a). By the definition of J, there exists n such that a € (ay). Then (aj) CI = 
(a) © (an) € (a;) for every 7 = n, so the chain is stationary. 

Now let z be any irreducible element of R, and let a be an element of R not 
divisible by z. Since z is irreducible, z and a are relatively prime. By F4 we have 
1=xz + ya, for appropriate x, y in R. Multiplying by an arbitrary b € R we get 


(20) b = (xb)x + y(ab). 


This says that if a divides ab, it divides b. Therefore z is prime. 


Remarks. (1) The converse of F9 is not true. For example, the polynomial ring 
Z[X] over Z is a unique factorization domain, by a theorem of Gauss (see next 
chapter), but it is not a principal ideal domain (again see next chapter; but this is 
easy to see directly — for instance, the ideal (2) + (XY) cannot be principal in Z[X’].) 


(2) Euclidean domains are principal ideal domains (F6), and principal ideal domains 
are unique factorization domains. In particular, Z is a UFD (Euclid, ca. —330), and 
so is any polynomial ring K[X] over a field K (Stevin, 1585). In a Euclidean 
domain, though, there are additional benefits arising from the Euclidean valuation 
v. For example, a gcd of two given elements a # 0 and 5 can be computed step- 


Unique factorization domains 39 


by-step through the Euclidean algorithm: 


b =qa+nr with v(r1) < v(a), 
a =qtith with v(r2) < v(r1), 


Mi-1 = iti +rit1 — with v(ri41) < vi), 


Mm—-1 = Gnt'n + 0. 


Then r, is a gcd of a,b and these equations even provide, by recursion, elements 
x,y such that r, = xa+ yb. In the case R = K[X], with v as in Example (ii) 
following Definition 4, the elements g;,7; are uniquely determined by a,b. The 
same is true for R = Z, if we demand that each r; be nonnegative. 


4. We now wish to inspect more closely the situation in unique factorization do- 
mains. By taking a factorization of the form (17) and grouping together irreducible 
factors that are associated to one another, we arrive at a representation of the form 


(21) a=nn,'m,”...1e" with n € R*, e EN, 


where z; is not associated to a; if i A j. If the ring in question is a UFD, this 


representation is essentially unique; indeed, if besides (21) there were another such 


decomposition a = ep! : pear we would have m = n, and (after renumbering) 


Oj = 7m; and e; = f; for alli = 1,2,...,m. In this sense (21) is called the prime 
factorization of a. 

It turns out to be useful to extend our terminology a little in a formal sense. 
Toward this goal we first fix a directory of primes P of R specifying a representative 
for each class of associated primes; that is, ? is a set of nonzero prime elements of 
R such that every nonzero prime of R is associated with one and only one element 
of %. (Such a ¥ exists by the axiom of choice.) In many cases there is a canonical 
choice for % — for example, in R = Z the set of natural prime numbers stands out, 
and in the polynomial ring K[X] over a field K we can take for # the set of all 
normalized prime polynomials. In any case we have: 


F10. Let R be a unique factorization domain and P a directory of primes of R. 
Every nonzero a € R possesses a unique representation of the form 


(22) a=eé I] me, 


where € is a unit of R and the eg are nonnegative integers with e; = 0 for almost 
all x € (that is, all but finitely many m € P). 


There is also a sort of converse to this statement: 


F11. Let R be a integral domain and P a subset of R ~ {0}. If every nonzeroaé R 
can be uniquely represented in the form (22) above, then R is a unique factorization 
domain and # is a directory of primes of R. 
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Proof. Each x € # is of course irreducible, by assumption. Now let z be any 
irreducible element of R. Again from the assumption we have x = ex’, with ea 
unit and x’ € P, both being uniquely determined. Thus z is associated to exactly one 
x’ €P. Overall, the assumption implies that every a 4 0 has a unique decomposition 
into irreducible factors — that is, R is a UFD. Since in a UFD being an irreducible 
element is the same as being a nonzero prime, the proof is complete. 


Let R be a unique factorization domain and x any irreducible element of R. 
For every nonzero a € R we denote by wz (a) the (highest) exponent with which z 
appears in a. Thus we have a unique representation of the form 


(23) a=n""q! with rta’. 

We also set wz(0) = oo. Thus we obtain a map 

(24) W_: R>ZU {oo}, 

which obviously enjoys the following properties: 

(25) Wx (ab) = wz (a) + Wr (d), 
(26) Wr(at+b)= min(wz (a), Wx (b)). 


It should be stressed that both the definition of w, and property (25) depend on the 
assumption that R is a UFD. 
If K is the fraction field of R, we can extend wz, to a map 


(27) Wr: K > ZU {ov}, 
by setting 
(28) Wr (a/b) = Wr (a) — Wz (d). 


Because of (25), this wz is well defined. Moreover now (25) and (26) hold for all 
a,b € K. We call wz the 2-adic valuation on K. The ability to extend arithmetic 
considerations to fraction fields has certain advantages. 


F12. Let R be a unique factorization domain with fraction field K = Frac R and fix 
a directory of primes P. 


(i) Every nonzero element x € K has a representation 


(29) x= I] nr) with e€ R™, 
mEPY 
where Wx (x) = 0 for almost all x € Pf. 
(ii) An element x € K is in R if and only if wz(x) = O forall a €. 
(iii) For a,b € R we have a|b if and only if wz (a) < wz (b) forall x EF. 
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(iv) Given arbitrary a1,...,4n € R, there exist a gcd and an Icm for a,,...,Qn, 
and in fact 
d = I] grmin(wx (41),....W2 (an)) isa gcd 
rep 
and 
m= I] gp Max(Wr (41),---W (an) is an lem 
EP 
Of a1,...,4n (where 1@ is to be understood as 0 if it occurs). 


Proof. Part (i) follows easily from F10 with the help of (28). If wz(x) = 0 for all 
z, then x € R, by (29). The converse is clear, so (ii) is established. Since a|b is 
equivalent to b/a € R, part (iii) follows using (28). Part (iv) now is an automatic 
consequence of (iii). 


5. The foregoing sections have dealt with little more than the general foundations 
of elementary arithmetic. We now wish to introduce some ring-theoretical concepts 
connected with our discussion in Section 3.2. 


Definition 8. Let R be a (not necessarily commutative) ring with unity 1 4 0. We 
call R simple if every homomorphism R — R’ into an arbitrary ring R’ is either 
injective or the zero map. Clearly (see Section 3.2) a ring R (with 1 4 0) is simple 
if and only if {0} and R are the only ideals of R. An ideal J ~ R of R is a maximal 
ideal of R if there is no ideal of R distinct from J and R and containing J. 


F13. I is a maximal ideal of R if and only if the quotient ring R/TI is simple. 


Proof. Ideals of R containing J are in one-to-one correspondence, via the quotient 
map 7: R— R/T, with ideals of R/J. 


Definition 9. Let R be a commutative ring with unity. An ideal J of R is called 
a prime ideal of R if R/TI is an integral domain. This condition is equivalent to 
saying that J ~ R and 


(30) abel = aélorbel. 


Thus a principal ideal (x) of R is prime if and only if z is a prime element of R. 


F14. Let R be a commutative ring with unity. R is simple if and only if R is a field. 
Therefore an ideal of R is maximal if and only if R/I is a field. Moreover, every 
maximal ideal of R is also a prime ideal. 


Proof. Only the first assertion needs to be proved. Let R be a field and g: R= R’ 
a ring homomorphism. If the kernel of g contains a nonzero element a, it contains 
every element x of R, because x = (xa7!)a; thus @ is the zero map. Conversely, 
assume R is simple and take a nonzero a € R. Then (a) = R, so there exists x € R 
such that ax = 1. 


We single out a special case: 
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F15. [f E, F are fields and g : E — F is a homomorphism of rings with unity 
(meaning that plz = |F), then @ is injective, and so provides an isomorphism 
between E and a subfield E' of F. 


Definition 10. Let R be a commutative ring with unity. Two ideals 1, 7, of R are 
relatively prime if I; + Iz = R; in other words, when there exists a € J; and b € I, 
such that a+ 6 = 1. The product I, I, of two ideals I,, I of R is the ideal of R 
generated by all products xy, where x € J; and y € J; thus is consists of all finite 
sums of such products. Clearly J) 7, € 1) N12. 


Lemma. (a) For I, [2 relatively prime ideals of R we have 1, I, = 1, 11. 
(b) [fan ideal 1, of R is relatively prime to each of the ideals In, 13,..., In of R, 
it is also relatively prime to the product I71I3...In. 
Proof. (a) From 1 = a+b with a € J; and b € Jy we conclude by multiplying with 
an arbitrary c € J; NJ, thatc =ca+cbe TI. 


(b) By assumption there exists for each i = 2,3,...,m an element a; € J; anda 
b; € I; such that 1 = a; + );. It follows that 


1=[[@th)ehthh...In 


I 


F16 (Chinese Remainder Theorem). Let [,, I2,..., In be pairwise relatively prime 
ideals of a commutative ring R with unity. The natural ring homomorphism 


(31) R- R/I, x R/Inx---x R/In 

is surjective, that is, given any elements X1,X2,...,Xn of R there exists x € R such 
that 

(32) x=x;mod/J; fori =1,2,...,n. 


The kernel of the map (31) is the ideal 
(33) LHADN:::Al, = In... In, 
so the element x in (32) is uniquely determined modulo the ideal (33). 


Proof. Consider first the case n = 2. By assumption there exist e; € J; and e2 € In 
such that e; + €2 = 1. For arbitrary x1, x2 € R, the element 


xX=XxX2€;+X1e2 


is then a solution of the system (32). Now let n > 2 be arbitrary. By induction we 
can assume that there exists x’ € R such that 


x’ =x; mod J; fori =2,...,n. 
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By part (b) of the preceding lemma, J; is relatively prime to the product [71/3 ... In; 
thus, thanks to the previously settled case n = 2, there exists x € R such that 


x =x; mod J; and x=x' mod Ih... In. 


Then x clearly satisfies all the congruences in (32). As for the kernel of (31), it 
obviously equals the intersection of the J;. But by the preceding lemma one easily 
concludes by induction that 


HADN:::Al, = yN(d2...In) = [11n...In. 


> 


Prime Factorization in Polynomial Rings. 
Gauss’s Theorem 


1. Let a € C be an algebraic number. When looking into whether o is constructible 
from {0,1} with ruler and compass, we were led to investigate in particular the 
degree of the field extension @(a)/Q@ (Chapter 1, F9). Now, Q@(a@):Q@ equals the 
degree of the minimal polynomial of w over K (Chapter 2, F3). Thus, if we already 
know a (normalized) polynomial f € Q[X] such that f(a) = 0, our task is to 
determine whether /f is irreducible. If so, we have found our desired minimal 
polynomial — itis f (Chapter 3, F6). If not, we must continue the search by looking 
for irreducible factors of /. 


Example. Consider a = e?7"/", for n > 1 a natural number. Since 
4k 0 eg NG ara ee GaP 


the polynomial f(X) = X¥""!+---+ ¥ +1 satisfies f(a) =0. Is f irreducible? If 
n has a proper divisor d, surely not, since in this case (X% — 1)/(X — 1) is a divisor 
of f. But if 7 is prime we will see in F11 that f is indeed irreducible. 


This is not the place for a comprehensive study of the problem mentioned in the 
first paragraph. Nonetheless, we would like to shed some light on certain theoretical 
aspects of the issue. Two very natural questions will guide us: (i) Is the ring Z[X] a 
UFD? (ii) Is every polynomial f € Z[X] that is irreducible in Z[X] also irreducible 
in Q[X]? These questions are intimately connected and were both answered by 
Gauss in the affirmative. 


2. In the remainder of this chapter, 
R will always be an integral domain. 


Question (i) above can be generalized to read: When is R[X] a UFD? For practice 
with basic algebraic constructs, we start by establishing the following: 


F1. R[X] is a principal ideal domain if and only if R is a field. 


Proof. If R= K is a field, K[X] is a Euclidean domain and therefore a principal 
ideal domain (Chapter 4, F6). 
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Conversely, assume R[X’] is a principal ideal domain. Consider the kernel J of the 
substitution homomorphism g : R[X] > R with y(X) = 0. Since @ sends each 
polynomial in R[X’] to its constant term, J = (XY). Thus 


(1) R[X]/X = R[X/I~R 


is an integral domain, that is, X is prime in R[X], by Definition 9 of Chapter 4. 
But then the next statement shows that R[X]/X is actually a field, and therefore by 
(1) R is also a field. 


F2. If A is a principal ideal domain and x is irreducible in A, the quotient A/x is a 
field. 


Proof. According to F14 in Chapter 4, we have to show that (zr) is a maximal ideal 
of A. Let I be a proper ideal of A containing (7). By assumption, J = (a) for 
some a. Then a|z anda ¢ A”, so since z is irreducible we have (7) = (a) = I. 
(Incidentally this shows again that every irreducible element of a principal ideal 
domain is prime.) 


F3. If R[X] is a UFD, so is R. 
Proof. Take a nonzero a € R. Since R[X] is a UFD, we can write 
a= ep\(X)p2(X)... pr(X) 


with e © R[X]* = R%* and pi(X),..., pr(X) all prime in R[X]. But then all the 
factors have degree zero, so define 2; := p;(X) € R. For elements of R, divisibility 
in R[X] coincides with divisibility in R, so the 2; are prime in R. Thus every 
nonzero a € R can be represented as 


(2) d= €N1N2...1,y 


with ¢ € R* and the z; prime in R. If a € R is irreducible, we must have r = | in 
this representation, so a is associated to mz; and therefore also prime. This shows 
that R is a UFD, by F7 in Chapter 4. 


3. We now prove the converse: 
Gauss’s Theorem. /f R is a UFD, so is the polynomial ring R(X]. 


We begin with some preliminary observations, which are of interest in and of 
themselves. Every ring homomorphism g : R > R’ can be naturally extended to 
a homomorphism R[X] — R’LX] between the corresponding polynomial rings, by 
setting 


(3) yoax' 6 Yi g@x': 


we will denote the extension by gy as well. (By the way, if we regard R’[X] as an 
R-algebra via g, the map (3) can be thought of as a substitution homomorphism.) 
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Given a € R, we consider in particular the quotient map R — R/a and its natural 
extension 


(4) R[X] > (R/a)[X]. 
F4. (i) The homomorphism (A) yields a natural isomorphism (of R-algebras) 
R[X]/a > (R/a)[X]. 
(ii) An element a € R is prime in R if and only if it is prime in R[X]. 


Proof. Part (i) follows from the Fundamental Homomorphism Theorem, since the 
kernel of (4) is clearly J = aR[X]. Part (ii): since R[X]/a ~ (R/a)[X], we have a 
prime in R <> R/a is an integral domain <> (R/a)[X] is an integral domain 
<=> R[X]/a is an integral domain <= a is prime in R[X]. 


From now on we assume R is a unique factorization domain. We denote by 
K=FracR 


the fraction field of R. Now let z be a given nonzero prime of R. The corresponding 
wt-adic valuation Wz: K — ZU {oo} in R (page 40) can be extended to a map 


(5) Wa: K[X] > ZU {00} 
as follows: set 
(6) wx( >a: X') = min {wz (ai ) | i= O}. 


Thus, for f € R[X], the value of wz(/) is the exponent of the highest 2-power 
that fits in all coefficients of f. By equation (25) in Chapter 4, 


(7) Wrlcf)=wWrlc)+uwr(f) force K, f € K[X]. 
The springboard for the proof of Gauss’s Theorem is provided by the next result: 


F5. Let R be a unique factorization domain and xn # 0 a prime in R. With the 
notations introduced above, we have, for all g,h € K[X], 


(8) Wx (gh) = Wr(g) + wr (A). 
Proof. Clearly, for every f € K[X] there exists c € R such that cf € R[X]. So 
taking (7) into account, we can assume without loss of generality that g, € R[X]. 
For simplicity we set w = wz. By the definition of w we then have g = 1”) gy 
and h = 1¥h,, where g1,h, € R[X] are polynomials satisfying 
(9) w(gi)=0, whi) =0. 
We obtain gh = 1¥8)+¥™) ¢h,, which together with (7) implies w(gh) = w(g)+ 
w(h) + w(g1/1). Thus we must show that 

w(gihi) =0. 


Suppose to the contrary that w(gih1) > 0, that is, 7|g1/41. By F4@i) we then have 
|g, or w|hy, that is, w(g1) > 0 or w(A1) > 0. But this contradicts (9). 
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Definition. A nonconstant polynomial f € R[X] (that is, one whose degree is at 
least 1) is called primitive if the gcd of the coefficients of f is 1. 


Thus a normalized polynomial in R[X’] is trivially primitive. If R is a UFD, 
every nonconstant polynomial g € R[X] can be represented as 


g=agi, with ae R~ {0} and g; € R[X] primitive. 


Also, a is determined up to associatedness, being the gcd of the coefficients of g. 
We call a the content of g; more precisely, the uniquely determined principal ideal 
(a) is called that. As can be proved easily from F5, 


(10) Content(gh) = Content(g) - Content(/). 


We now formulate Gauss’s result a bit more precisely: 


Theorem 1 (Gauss). Let R be a UFD with fraction field K. Let P, be a directory 
of primes for R and Py, a directory of primes for K[X] containing only primitive 
polynomials of R[|X]. Then R[X] is a UFD and P; U P» is a directory of primes for 
RX]. 


(It is clear that there exists a #2 with the required properties.) 


Proof. Take any nonzero g € R[X]. Since K[X] is a UFD, there is a unique 
factorization 


(11) g=a I] f% with ae K* = K[XJ” and integers e¢ > 0, 
fEP2 


where er = 0 for almost all f € 9). Now, for any a € P; we have (see F5) 
Wr(g) = Wr(a) + Derwa(f) = wx(a), the latter equation because the f’s are 
primitive. It follows that w,(a) = 0 for all z, and thus also a € R (see F12 in 
Chapter 4). Now let 


(12) a=e|| x 


WEP] 


be the prime factorization of a in R. Together, (11) and (12) yield 


(13) g=el|[x«* [| s?. 


TEP] fEP2 


This representation is unique, that is, ¢, the e, and the ey are uniquely determined 
by g. For if a representation of the form (13) is given, a comparison with (11) 
immediately yields (12), since K[X] is a UFD; but now since R too is a UFD, the 
representation (13) is completely fixed. Now keeping in mind Chapter 4, F11, the 
proof is complete. 


F6. Let R be a UFD with K = Frac R, and let g € R[X] be nonconstant. If g is 
irreducible in R[X], it is irreducible in KX]. 
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Proof. By Theorem 1 we have (with #2 as in the theorem’s statement) 


g=ef with e€ R* and f €Do. 


Thus g is irreducible in K[X]. 


Conversely, if g € R[X] is irreducible in K[X], then g is irreducible in RLX] if 
and only if g is primitive. 


F7 (Gauss’s Lemma). Let R be a UFD and K = Frac R. If f(X) € R[X] can be 
expressed as 


S(X) = g(X)A(X) with normalized g,h € K[X], 
all the coefficients of g and h lie in R. 
Proof. For any prime 1 4 0 of R we have 
Wr(f)=0 since f € R[X], 


Wx(Z), Wr (h) < wez(1) =0 since g,h are normalized. 


But wz(f) = wa(g) + wz (h), so all three integers vanish. Since a was arbitrary, 
all the coefficients of g and h belong to R (see Chapter 4, F12). 


F8. Let R be a UFD and K = Frac R. Let f € R[X] be a normalized polynomial 
with coefficients in R. Then any root a of f that lies in K actually lies in R, and 
moreover divides the constant term of f. 


Proof. Take a factorization f(X) = (¥ —a@)h(X) in K[X]. Since f is normalized, 
so is h. By Gauss’s Lemma (F7), all the coefficients of X¥ —@ and h(X) are in R. 
In particular, a € R, and since do := f(0) = (—@)h(0) we have a | do. 


Application. Consider the special case R = Z, K = Q. We prove that f(X) = 
X?3 —2 is irreducible. Suppose f were reducible. Since it has degree 3, it would 
have a linear factor, and therefore a root aw in Q. By F8, X 3 —2 would also have a 
root in Z, which is clearly not the case. 

From the irreducibility of ¥? — 2 it follows that </2 is not a rational number. 
Using similar arguments one can easily derive from F8 the irrationality of numbers 
such as VJ3, “7D, Dt. (Note that although 27 18 irrational, ¥!* — 27 is not 
irreducible. As an exercise prove that X° — 72 is irreducible— compare F11(d) in 
Chapter 2.) 


4. The next statement suggests a fundamental principle by means of which one can 
investigate the irreducibility of polynomials: 


F9. Let R be an integral domain and let a +> @ be a homomorphism of R into an 
integral domain R; extend this to a homomorphism R[X|— R[X] of polynomial 
rings in the usual way: 


f= la X's f= ya 
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Let f(X) = anX" + +++ + do be a primitive polynomial of R[X] with an # 0. If f 
is irreducible in R|X], then f is irreducible in R[X]. 


Proof. Assume the opposite. Then f = gh, where g, / are nonconstant polynomials 
in R[X] (nonconstant because f is primitive). Taking the image in R[X] gives 
f =ih. Since Gy is nonzero, we must have deg J = deg g > 1 and degh=degh>1. 
Since R is an integral domain, we get a contradiction with the assumption that ve 
is irreducible. 


In applying F9, one is usually dealing with a unique factorization domain R, 
because apart from the fact that otherwise one has hardly any control over the 
primitivity of f, it is also not permissible in the general case to deduce that f is 
irreducible over K = Frac R. On the other hand, the train of thought that leads to 
F9 can be useful even if we don’t know ahead of time that ra is irreducible, but 
rather we know something about the possible factorizations of ta in R[X]. Here is 
the best known application of this approach: 


F10 (Eisenstein irreducibility criterion). Let R be an integral domain, and let 
fC) =aX" + ag X?"™ ++---+ta,;X +ao 


be a primitive polynomial in R[X]. If there exists a prime x of R such that (i) r{a, 
(ii) x | a; for 0 <i <n—1, and (iii) x* tao, then f is irreducible in R[X]. If R is a 
UFD, f is also irreducible in K{X], where K is the fraction field of R. 
Proof. Since 1 was assumed prime in R, the quotient R := R/z is an integral 
domain. We now work as in the proof of F9. Suppose / has a nontrivial factorization 
in R[X]: 

f=gh, with r=degg>1 and s=degh>1. 


Taking the image in R[X] leads to, as before, 
f=Zh, with degg=r and degh=s. 


To be sure, vA is not irreducible, but because of condition (ii) it has the form of = 
aX", so that in R[X] the equation 


(14) aX" = zh 


holds. Set k = Frac R. Since k[X] is a UFD and @ ¥ 0, it follows from (14) that 
Z and h have the form g = BX" and h = yX°, with B,y € k. But r,s > 1, so 
in particular Z(0) = (0) = 0, that is, 7| g(0) and 2|h(0). This implies that 7? 
divides g(0)A(0) = f(0) = ao, contradicting condition (iii). 


Here is an immediate consequence of Eisenstein’s irreducibility criterion: All 
polynomials of the form 


X"—a, with ae Z~ {1,—1} square-free 


are irreducible in Z[X] and therefore also in Q[X]. (An integer a € Z is called 
square-free if it is not divisible by the square of any prime.) 
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F11. /f p is a prime number, the polynomial 
Fy(X)= XP 14 XP 74-4 X41 
is irreducible in Q[X]. 


Proof. We show that Eisenstein’s criterion can be applied after a simple change of 
variables. Instead of F(X), consider the polynomial 


f(X) = F(X +1). 


This / is irreducible in Q[X] if and only if F, is, because the two polynomials are 
mapped to one another by the automorphism of Q[X] coming from the invertible 
substitution X + X¥ +1. Since F,(X)(X — 1) = X¥? —1 we get f(X)X = 
(X + 1)? —1, that is, 


P 
{Qa (Ae =p+(B)xtea( ? )xr tere, 


k=1 


Relative to the prime p of Z, this is an Eisenstein polynomial of Z[X]— that is, it 
satisfies all the conditions required for the application of the Eisenstein criterion, as 
a result of the fact that p divides 


—l)...(p—k 
re 


for 0 <k < p. (This is seen as follows: For 0 < k < p, the numerator, but not the 
denominator, of the fraction in (15) is divisible by p; since we already known — on 
combinatorial grounds, for instance — that (2) is an integer, the divisibility claim is 
proved.) 


F11 has immediate repercussions for the constructibility of regular polygons: 


F12. Let p be a prime. A construction of the regular p-gon with ruler and compass 
is impossible unless p — 1 is a power of 2. 


Proof. If ¢:= e?7'/? lies in AQ, the degree Q(¢) : Q is a power of 2, by Chapter 1, 
F9. But by F11 we always have Q(¢):Q = p—1 (review Chapter 2, F3 and Chapter 
3, F6). 


Thus a regular heptagon (7-gon) is not constructible with ruler and compass, nor 
is an 11-gon, a 13-gon, a 14-gon, a 19-gon, and so on. And neither is a 9-gon, as 
can be seen from the following generalization of F11: 


F13. We keep the notations of F11. Ifn = p” is a prime power, the minimal polyno- 
mial of €:= e?7'/" over Q is 


6) OO S1 bP 4 PP 4 OP = ), 


In particular, Q(¢):Q = p’—!(p—1). 
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Proof, Since X?" —1 = xe — 1)Fpr (X), our € is certainly a root of Fr. It 
suffices to show that the polynomial f(X) := Fpr (X + 1) in Z[X] is an Eisenstein 
polynomial with respect to p. We work in Z[X] modulo p. As we saw in the proof 
of Fll, F,(X +1) = X?7! mod p. Thus 


r—1 


f(X) = For (X $1) = Fo((X +1)?" ") = Fy (XP! $1) = (XP)?! mod p. 


In addition, f(0) = Fpr (1) = p and / is normalized, so f is indeed an Eisenstein 
polynomial with respect to p. 


Until now we have left open the question whether for primes of the form p = 
2” + | it is indeed always possible to subdivide the circle into p parts with ruler and 
compass. If this is so, amoment’s thought shows that the constructibility problem for 
arbitrary n has been completely solved (apart of course from the number-theoretical 
question of which primes have the form 2” + 1). It turns out that the division of 
the circle into p parts with ruler and compass is possible for all primes of the form 
2 +-1, but we will only substantiate this fact after we have developed our conceptual 
machinery some more and acquired more powerful tools. (The reader interested in 
learning more about Gauss’s more direct approach to this problem should consult his 
Disquisitiones Arithmeticae.) Incidentally, Gauss at the age of eighteen had already 
discovered a way to construct a regular 17-gon, before he had a thorough proof of 
the impossibility of constructing, say, a regular heptagon. 

We now address the problem of trisecting an angle: 


F14. Consider an angle y with 0 < g < 2m. If e!® is transcendental, g cannot be 
trisected with ruler and compass. 


Remarks. (1) The condition in F14 is satisfied for uncountably many g. Indeed, 
the function yg + e!® provides a bijection between the interval [0,27:) and the unit 
circle in C; therefore there can be only countably many ¢ € [0, 277) for which e!? 
is algebraic. 


(2) Even when e’® is algebraic, the trisection of g is by no means necessarily 
possible. Consider for example g = 27/3. Trisecting y amounts to constructing a 
9-gon, which as we know from F13 is impossible. Actually it is also easy to prove 
directly that Q(e??/3): Q(e’?) = 3. 

(3) Again in connection with the transcendence condition in F14, the famous 
Hermite—Lindemann Theorem says that if z is any nonzero algebraic complex num- 
ber, e7 is transcendental — in particular, e’? is transcendental for any algebraic value 
of the angle g. Since e’” = —1, the theorem also implies that z is transcendental. 
We will give a proof of the Hermite—Lindemann Theorem in Chapter 17. 


Proof of F14. Let K = Q(e!®), and suppose t = e’® is transcendental. We must 
show that z = e!?/3 does not belong to AK. This will be done if we prove that 


(17) K(z):K =3 


Irreducibility criteria and applications 53 


(see Chapter 1, F9). Now, z is certainly a root of the polynomial X?— 1 over K. 
To prove (17) therefore we have to show that 


(18) X?—t is irreducible in K[X’. 


Since ¢ is transcendental, K = Q(f) is the field of rational functions in ¢ over Q 
(Section 3.4), which is to say, the fraction field of the polynomial ring R := Q[t]. 
Since R is a UFD and ¢ is prime in R, an application of the Eisenstein criterion to 
the polynomial X? —1 proves (18). 


6 


Polynomial Splitting Fields 


1. Still bearing in mind the initial problem of Chapter 1, our task now consists in 
the study of (finite) field extensions E/K. One fundamental question concerns the 
possible intermediate fields of E'/K (see Chapter 1, Theorem 1’). Our subsequent 
discussion will benefit from the introduction of a convenient shorthand: 


Definition 1. Let E; and E> be extensions of a field K, which we regard as algebras 
over K. A homomorphism of K-algebras o : FE; — EF is called a K-homomorphism 
from E, to Ez. We also say that o is a homomorphism from the extension E,/K to 
the extension E,/K and write 


(1) o:E,/K > E2/K. 
If o is an isomorphism, we say that the extensions F,/K and E',/K are isomorphic. 


Remark. A field homomorphism o : E; — E2 satisfies o(1) = 1 by definition; 
therefore it is always injective and so gives rise to an isomorphism of E; with a 
subfield of Ey. If £; and E> are both extensions of a field K, a field homomorphism 
o is a K-homomorphism if and only if 


(2) o(c)=c_ forallce K. 


From Definition | there is a steep but well-traveled path to Galois theory (opened 
largely by Dedekind and E. Artin; see the latter’s Galois Theory). Here we will take 
the more leisurely and scenic route. The following result is simple but far-reaching: 


F1. Let E/K and E'/K' be field extensions and o : K — K' a field homomorphism. 
There is a natural extension of o to a ring homomorphism K|X|— K'[X]; we still 
call it 0, but for f € K[X] we often write f° instead of o(f). Let f € K[X]. 
(a) Every homomorphism t : E — E' extending o maps any root of f in E toa 
root of f° in E’. 
(b) Assume o : K — K' is an isomorphism. Let a be a root of f in E and a’ a root 
of f° in E’. If f is irreducible over K, there is an isomorphism t : K(a) > 
K'(a’) extending o and such that t(a) = a’. 
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Remark. As an important special case, take K’ = K and o = idx. Then a Tt as 
in part (a) is a K-homomorphism from EF to E’, and a 7 as in part (b) is a K- 
homomorphism from K(q@) onto K(a’). By the way, the irreducibility assumption 
in (b) cannot be dispensed with. 


Proof. (a) Suppose f(X) = do + a1 X +--+ +a,X". If 0 = f(a) = ao + aia + 
-+++ ana” then 0 = t(f(a@)) = o(ao) + o(a1)t(@) +--+ +0 (an) t(a)” = f° (ta). 


(b) Let f € K[X] be irreducible; we may as well assume it normalized. Then / is 
the minimal polynomial of w over K. We define t by setting 


t(g(a)) = g°(a@’) for g € K[X]. 
Is t well-defined? If g1(a) = g2(a) we get (g1 —g2)(a) = 0, so g1 -g2 =Af with 


he K[X]; then gf —g5 =h° f° and therefore gf (a’)—g5 (a’) =h? (a’) f? (a’) = 0. 
It is clear that t is a surjective homomorphism from K(q) to K’(a’) extending o. 


In order to have some room to maneuver, we quote now a result whose proof — 
in spite of the statement’s spartan simplicity — requires further preliminaries and is 
postponed to the end of the chapter. The general construction principles laid down 
in preparation for the proof (Sections 6.2 and 6.3) will also be important in other 
contexts. 


Theorem 1. Let (E;)icz be an arbitrary family of extensions E; of a field K. There 
exists an extension E of K and homomorphisms 1; : E;/K — E/K such that E is 
obtained from K by adjoining the union of the sets t; E;, fori € 1. 


Definition 2. A field C is algebraically closed if every nonconstant polynomial 
F(X) € C[X] has a root in C. 


Remark. The field C of complex number is algebraic closed; this is proved through 
analysis, function theory or algebra (see Volume II for the latter). 


F2. The following statements about a field C are equivalent: 
(i) C is algebraically closed. 
(ii) Every irreducible polynomial in C[X] is linear (that is, of degree 1). 


(iii) Every nonconstant polynomial in C[X] is completely decomposable into linear 
factors. 


(iv) If E/C is an algebraic field extension, E = C. 


Proof. (i) => (ii): Let f € C[X] be irreducible. By (i) there exists a € C with 
f(a) =0. Then f is divisible in CLX] by X —a, that is, f = y(X —a), necessarily 
with y E€C%. 

(ii) > (iii): By (ii), only linear polynomials can appear in the prime factorization 
of f €C[X]. 

(iii) > (iv): Let E/C be algebraic. The minimal polynomial f = MiPoc(q) of 
any a € E is irreducible, hence linear, by (iii). Therefore a € C. 


The algebraic closure of a field 57 


(iv) => (i): Suppose f € CLX] is nonconstant. By Chapter 3, Theorem 4, there is 
an extension E/C and ana € E such that f(a) = 0. But C(a@)/C is algebraic, so 
by (iv) we have C(a) = C, that is, a eC. 


Theorem 2 (Steinitz). Let K be a field. 


(1) There exists an extension C of K with the following properties: 
(i) C is algebraically closed. 
(ii) C/K is algebraic. 
Such a field is called an algebraic closure of K. 
(ID) If C; and Cy are algebraic closures of K, the extensions C,/K and C/K are 
isomorphic. 


Proof of part (1). Let K[Xn, 1 € N] be the polynomial ring in countably many inde- 
terminates X,, X2,... over K. Consider the set J of all subsets M C K[X;,, ne N] 
such that 


M isa maximal ideal of K[X,..., Xm] for some m EN. 


For each such M, let Eyg:= K[X1,..., Xm]/M be the corresponding quotient field. 
We regard Fy as an extension of K. Now apply Theorem | to the family (Ey4) ver, 
to conclude that there exists a field extension E/K and K-homomorphisms 


om: Em 7 E 


for each M € J. We claim that for every finite field extension L/K there exists 
a K-homomorphism from L into E. Indeed, if L = K(q1,...,@m) with each a; 
algebraic over K, consider the homomorphism of K-algebras g: K[X1,..., Xm] > 
L defined by g(X;) = a;. Let M be its kernel. Then g yields an isomorphism 
K[X,,..., Km]/M — L. Hence M is a maximal ideal of K[X,..., Xm], and the 
claim is proved. 

Now let C be the algebraic closure of K in E (see Chapter 2, F8). The ex- 
tension C/K is certainly algebraic; we show that C is also algebraically closed. 
Suppose otherwise. Then there is an algebraic extension F/C with F # C. Take 
a € F\C, and let f be the minimal polynomial of a over K. (Note: a is alge- 
braic over K because it is algebraic over C and C/K is an algebraic extension.) 
Suppose f has exactly distinct roots 61,...,6, in C, and form the subfield 
L= K(a, Bi,..., Bn) of F. Then L/K is finite (see Chapter 2, F7). Therefore, by 
the italicized statement in the previous paragraph, there exists a K-homomorphism 
gy: K(a, Bi,...,Bn) > C. But then g(a), g(B1),...,¢(Bn) aren +1 distinct roots 
of f in C. Contradiction! 


Remark. The extensions Ey7/K are all algebraic (so £/K itself can be assumed 
algebraic); but this is harder to prove and we don’t need it here. 


The proof of part (II) of Theorem 2, the uniqueness part, ensues from: 
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Theorem 3. Let 0 : K — K' be an isomorphism of fields and let L/ K be an algebraic 
field extension. If C is an algebraically closed extension of K', the map o can be 
extended to a homomorphism t : L > C. 


Proof. (a) We deal first with the case K’ = K and o = idx. An application of 
Theorem | to E; := L and E2 := C shows there exists an extension E/K and 
K-homomorphisms 0; : E; — FE such that E = C’(L’), where L’ := o;L and 
C’:=02C. The extension C’(L’)/C’ is algebraic and C’ is algebraically closed. It 
follows that C’ = C’(L’) = E, so 02: C > E is an isomorphism. Then we can look 
at the K-homomorphism o, ' oo, : L + C; this homomorphism is an extension t 
of o = idx as desired. 


(b) Now let o : K — K’ be any isomorphism. As can easily be checked, there is 
an extension L’ of K’ and a homomorphism p: L —> L’ that agrees with o on 
K (replacing the elements of K in L by elements of K’). By (a) there exists a 
K'-homomorphism 1’ : L’ > C. Then t = t’0p: L > C is an extension of o. 


Proof of part (Il) of Theorem 2. By Theorem 3, there is a homomorphism 
t:C,/K > C2/K. But C,/tC, is algebraic (because C/K is), and tC; is 
algebraically closed (like C;). It follows that Cp = tC), so tT is an isomorphism. 


F3. Every endomorphism of an algebraic field extension E/K is an automorphism. 


Proof. Leto: E/K — E/K be a homomorphism of field extensions. We must show 
that o£ = E. Take w € E and set f = MiPox(a). Denote by Ng the set of all 
roots of f in &. Then o effects a permutation of Ny, because o takes roots of f 
to roots of f (see Fl) and a is injective. Thus, since w € Ng, there exists B € Ny 
such that o(B) = a. 


Definition 3. Let K be a field and f € K[X] a nonconstant polynomial. An exten- 
sion E of K is called a splitting field of f over K if there exist @1,02,...,Q@, € E 
such that f(X) = y(X¥ —a1)(X —a@2)...(X —a@y) and E = K(a1,02,..., Qn). 


The name echoes the expression “to split into linear factors”, which means the 
same as “to have a complete decomposition into linear factors”. 


F4. Every nonconstant f € K[X] has a splitting field over K. If E, E' are splitting 
fields of f over K, the extensions E’/K and E/K are isomorphic. 


Proof. Existence: Let C be an algebraic closure of K, which exists by Theorem 2. 
In C[X] we have 


S(X) = y(X —a)(X — a2)... (X¥ — an) 


with aj € C (and y € K the leading coefficient of ). Therefore the subfield 
K(a1,Q@2,...,Qn) of C is a splitting field of f over K. (Actually the existence of 
a splitting field also follows easily from Kronecker’s Theorem, given as Theorem 4 
in Chapter 3; see §3.3 in the Appendix.) 
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Uniqueness: Let C and C’ be algebraic closures of E and E’ (and therefore 
also of K and K’). By Theorem 3 there exists a K-homomorphism 1: E > C’. 
Clearly, since FE is a splitting field of f over K, so is TE; but since the splitting 
fields tE and E’ of f over K are both subfields of C’, it follows that rE = E’. 
Thus t yields a K-isomorphism between F and E’. 


Examples. (a) One splitting field of ¥4 —2 over @ is the subfield Q(</2,i) 
of C. Indeed, ¥4—2 = (X¥ — Y2)(X¥ —i/2)(X + 2)(X +1472) and 
Q(172,i 4/2) = Q( V2, i). 

(b) One splitting field of X”—1 over Q is the subfield E := Q(¢) of C, where = 
e27t/" Indeed, X" —1 = []ja1(¥ —&/) in ELX], because 1,¢,67,...,6"7! 
are all distinct. 


Definition 4. An algebraic field extension E/K is called normal if every irreducible 
polynomial f € K[X] that has some root in E splits into linear factors over E (in 
other words, E contains a splitting field of f over K). 


Theorem 4. Let E/K be an algebraic field extension, and let C be an algebraic 
closure of E (and therefore also of K). The following statements are equivalent: 
(i) E/K is normal. 
(ii) For every homomorphism o : E/K —> C/K we have cE = E (that is, 0 can 
be regarded as an automorphism of E/K). 
(ii) Every automorphism of C/K restricts to an automorphism of E/K. 
(iii) E is a splitting field over K; that is, there exists a set M © K[X] of nonconstant 
polynomials such that E = K(N), where N denotes the set of all roots of 
polynomials f € M inC. 


Proof. (iii) = (ii’): Let 0 : C/K — C/K be an isomorphism. Then o(NV) C N, so 
o(K(N)) C K(N), that is, o£ C E. Now F3 shows that oE = E. 


(ii’) => (ii): By Theorem 3, a homomorphism o : E/K — C/K can be extended to 
a homomorphism t : C/K —> C/K. By F3, t is an automorphism of C/K. 


(ii) > (i): Let f € K[X] be irreducible and suppose f(a) = 0 for some a € E. 
We must show that all roots of f in C already lie in E. So suppose f(6) = 0 for 
some B € C. By FI there is a K-isomorphism o : K(a~) > K(f) taking a to 6. By 
Theorem 3, o can be extended to a homomorphism t : EF — C. By (ii) we have 
tE = E. In particular B = oa = ta is an element of E. 


(i) > (iii): Set M = {MiPox (a) | a € EF} and let N be the set of 6 € C that are 
roots of polynomials f € M. By definition, E C N. If E/K is normal, we have 
N CE. Putting it all together we get E = N, so E = K(N). 


Remark. If E/K is a finite normal extension, we can obviously take M = { f} 
in (iii), with an appropriate choice of f € K[X]. (However in general it cannot be 
stipulated at the same time that f be irreducible.) 


As aconsequence of implication (111) > (i) of Theorem 4, we can state explicitly: 
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F5. Let f € K[X] be a nonconstant polynomial and let E be the splitting field of f 
over K. Then E/K is normal. 


The proof of the next statement is left to the reader as an exercise. 


F6. Let E/K be an algebraic extension. There is an extension E’ of E with these 
properties: 

(i) E'/K is normal. 

(ii) If L is an intermediate field of E'/E and L/K is normal, L = E'. 
Such an E" is called a normal closure of E/K. Any two normal closures of E/K 
are isomorphic as K-extensions. If E/K is finite, so is E'/K. 


2. We now equip ourselves with an important algebraic tool, which we will use to 
prove Theorem | among other things. Fix a field K. If / is a nonempty set, define 


KM :=K™) ={f:M—>K| f(u) =0 for almost all pe M}. 


There is a natural K-vector space structure on KM, with a canonical basis (€,) wem 
populated by the characteristic functions e,, of one-point sets {4} C M (defined by 
é,(v) = 1 if w=v and e,(v) = 0 otherwise). With the identification e,, = yu, every 
ff € KM has a unique representation of the form 


(3) f= ye Cuft with cy € K, 
ueM 


where c, = 0 for almost all wp ¢ M. 

Now give M a monoid structure, that is, an associative operation (written mul- 
tiplicatively) having an identity element. Then KM acquires a natural K-algebra 
structure, whereby the multiplication (u,v) + uv is extended distributively to all 
of KM. (When M = G is a group we call KG the group algebra of G over K.) 

We consider an application. Let (A;)jez be a family of K-algebras with unity, 
where I 4 ©. Set 


M= {(ai)ier |a; € Aj, a; = 14, for almost all iy 


By setting (a;);(b;); = (aibi); we make M into a monoid, whose corresponding 
monoid algebra we denote by KM. Elements of KM have unique representations 


of the form 
ye Cat, Cae K, 
a=(a;)eEM 


with cg = 0 for almost all a € MM. We now wish to construct a quotient algebra of 
KM where certain relations are obeyed. Take the K-subspace U of KM generated 
by all elements of the form 


(4) (a;)+(b;)—(s;), where aj +5; =s; for some j € J and a; =; =; for all other i’s 
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and those of the form 
(5) (a;) —c(6;), where a; = cb; for some j € J and a; = 5; for all other i’s. 


It is easy to see that U is in fact an ideal of KM, so we can take the quotient 
of KM modulo U. We denote this quotient by 


(6) () 4i = KM/U. 
iel 
If z is the quotient map, we set 


®; ai :=a(a) fora = (aj). 


If J = {1,2,...,}, we also write 
Ay @-+-@ An:= &) Ai. a1 @+++ @an := ®; ai. 
iel 


The K-algebra (6) is called the tensor product of the K-algebras A;, fori ¢ J. All 
its elements have the form 


>" ca(®&; a1), 
a=(a;)eEM 


but this representation is no longer unique in general. For each j € J there is a map 
Oj: A ‘hed ®&) A i 
i 


taking a € A; to the element &) a; defined by aj = a and a; = 14, fori # j. 
By definition, oj(14;) is the unity element in ®&), 4;, and for all a,b € Aj we 
have o;(ab) = 0;(a)o;(b). As can easily be seen from (4) and (5), we also have 
oj(a+ b) = oj; (a) +.0;(b) and oj; (ca) = coj(a) for c € K. Thus every o; is a 
homomorphism of K-algebras. This whole construction was aimed at showing that 
(6) yields a K-algebra, which, together with the 0; defined above, enjoys certain 
functorial properties: 


F7. Let (gj )ier be a family of K-algebra homomorphisms yg; : Aj > A. Then if 
(7) gi lay; (b) = 9 (6) gi) 
for alli # j andalla € A;,b € Aj, there exists a unique homomorphism of K- 
algebras 
Q: ®&) Aj >A 
iel 
such that poo; = g; foralli € I. 
Proof. (a) Suppose ¢ has already been found. Then 
9(®, 4) = o(T],; (ai) =], gor (ai)) = TT; (ai), 


so y is uniquely determined. Note that the expression |], ¢;(a;) is well-determined 
because of assumption (7). 
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(b) It is clear from (a) how @g is to be defined; but we must prove that the definition 
is consistent. The map (a;) + [], ¢;(a;) from M into A is multiplicative and thus 
can be extended to a homomorphism of K-algebras y: KM — A satisfying 


(E cat) = xe (I pia). 


It is easy to ascertain that elements of the form (4) and (5) are in the kernel of w. 
Thus y(U) = 0, so w gives rise to a K-algebra homomorphism 9: KM/U > A 
such that 


(8) 9(®), ai) =]; gi (a). 


In particular, p(o;(a)) = 9; (a) for all a € A;, which proves the assertion. 


Remark. All of this continues to work if K is a commutative ring with unity rather 
than a field. The only change is that when K is a ring we talk of K-modules rather 
than K-vector fields (though the name K-algebra remains). Now, not all K-modules 
are lucky enough to be free, that is, to contain a set of K-linearly independent 
elements whose K-linear combinations make up the whole module. If such a set 
exists, itis called a basis. Our KM does have a basis —in fact a canonical basis, the 
set (€u)wem (page 60). Thus KM is a free K-module. Other constructions so far in 
this section also work just as well in this more general setting, as the reader should 
check, with only the change that U is a K-submodule of KM. But watch out: our 
vector space intuition does not work so well with modules, and it is possible, for 
example, for U to be the whole of KM! So the tensor product (6) might be the 
zero ring, even if all the A; are nonzero. This does not happen, however, in the 
following special situation: 


F8. Let K be a commutative ring with unity and suppose (Aj)ier are K-algebras 
such that each A; has a K-basis M; with 14, € M;. (Un particular, the A; are free 
K-modules.) Then the family of elements of the form 


(9) &; bi with bj € Mj and bj = 14, for almost all i 
is a K-basis of the tensor product ®j<, Ai. 


Proof. Clearly the elements (9) span ®),<; Ai as a K-module. Let a = ®) ai be a 
fixed but arbitrary element among those in (9). For each i € J, let f; : Aj > K be 
the linear functional that assigns to each a € A; the coordinate of a corresponding 
to the basis element a;. Then consider the K-linear map 


ha: (Ai > K 
i 


such that he (®, x}) =]; fii). This map is well-defined. For an arbitrary ele- 
ment 6 = ®), b; of the form (9) we have 


ma) =ThO={1 iene ay 
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Now suppose we have a linear dependence 
oz cp(@; bi) =0, 
B=(b;) 


where the sum is over the elements 6 = (b;); in (9). Applying he we see that cg = 0. 
Thus the elements (9) are linearly independent over K. 


Here is an immediate consequence of F8: 


F9. We keep the assumptions of F8. The multiplicative identity 1 = @); 1a, of 
&) Ai is nonzero. For every j the map o; : Aj > @); Ai is injective; hence A; can 
be regarded as a subalgebra of &); Ai. 


Example. Let % be a nonempty set. For each X € #, let Ay := K[X] be the 
polynomial ring in one variable XY over K. Then 


K[#]:= &) Ax 


XEX 


is called the polynomial ring in the variables X € X over K. The elements of K[#] 
are called polynomials in the variables X € #. We regard K[X] as a subalgebra of 
K[#]. In view of F8, the family of monomials 


My (%):=[[X"®, with v = (v(X))x € (NU {0)™, 
x 
forms a basis of K[2#]. 


If X= {X,,..., Xn} has n elements, we set K[X1,..., X,]:= K[%]; in this case 
the basis representation of an f € K[%] has the form 


Le 3 
V=(V15-.50n) 


The following functorial property comes directly from the definition of K[%] 
together with F7: 


F10. Let K be a commutative ring with unity and & a nonempty set. If A is a com- 
mutative K-algebra, any map & — A can be uniquely extended to a homomorphism 
of K-algebras y : K[X|— A (the substitution homomorphism). 


If X#={X1,..., Xn} and aj := g(X;), denote by f(a1,..., an) the image o(/) 
of a polynomial f € K[X1,..., Xn] under g. In particular, f(%1,..., Xn) = f. 
Note also that K[X1,..., Xn] = K[X1,..., Xn—1][Xal- 


3. We now turn to an important algebraic application of Zorn’s Lemma. 


Zorn’s Lemma. Let M be a (partially) ordered set in which every chain has an 
upper bound. Then M has a maximal element. 
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Recall that a set M is called partially ordered, or simply ordered, if it is endowed 
with a relation < satisfying the following properties: (i)a<a; (ii)a<bandb<c 
imply a<c; (iii)a <b and b <a imply a =D. Let (M, <) be an ordered set. A 
subset N of M is a chain if it is totally ordered with the order induced from M; 
that is, if fora, b € N we either have a < b or b <a. Again let N C M. An element 
a € M is an upper bound for N if x <a for every x € N. Finally, me M isa 
maximal element of M if any m' € M such that m < m’' actually equals m. 

We take Zorn’s Lemma as a well-known fundamental statement of set theory; 
see for example the Wikipedia entry at http:/en.wikipedia.org/wiki/Zorn’s.lemma. 
As a typical example of its application, consider: 


F11. Let V be a vector space over a field K, and let T be a linearly independent 
subset of V. Then V has a basis B such that T © B. 


Proof. The set M of all linearly independent subsets of V containing T is ordered 
by inclusion. It is also nonempty, since T ¢ M. Let N 4 @ be achain in M. The 
union U of all Y € N is then linearly independent as well. Otherwise there would 
exist a finite subset A of U exhibiting a linear dependence; because A is finite and 
N is a chain, there would exist Y € N such that A C Y. But this is impossible, 
since Y was assumed linearly independent. 

By Zorn’s Lemma, then, there exists a maximal linearly independent subset B 
of V such that T C B. But such a set must be a basis for V. (Why?) 


The following result is important in our context: 


F12. Let I be an ideal ina ring R with unity. If I # R, there exists a maximal ideal 
of R containing I. 


Proof. Consider the set M of ideals J of R such that 1 € J > J, and order MW by 
inclusion. M is nonempty since it contains 7. If N # @ is a chain in M, consider 
the union J of all J € N. One checks easily that J €M; also J is obviously an 
upper bound for NV. By Zorn’s Lemma, then, WM has a maximal element J. From 
the definitions and the fact J # R (since 1 € J) we see that J is a maximal ideal 


of R and J contains J. 


Remark. For noncommutative rings, the exact same proof yields the corresponding 
statement about left ideals (instead of two-sided ideals). 


We are finally ready to pick up our long-awaited proof: 


Proof of Theorem 1. Consider the tensor product A = ®)<, E; of the extensions E; 
of K (regarded as K-algebras), together with the corresponding homomorphisms of 
K-algebras o; : E; — A. Then A is a (commutative) K-algebra with 1 4 0 (see 
F9; but in general A is not a field). By F12, there exists a maximal ideal M in A. 
Therefore A/M is a field, by F14 in Chapter 4. Being a K-algebra, A/M can be 
regarded as an extension E of K. The desired K-homomorphisms 1; : LE; > E are 
obtained by composing the o; with the quotient map A > A/M = E. It is clear 
that E arises from K by adjunction of the union of the sets 1; Ej. 
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Separable Extensions 


1. Definition 1. Let K be a field and C an algebraic closure of K. Two elements 
a, B €C are called conjugate over K if there is an automorphism o of C/K with 
o(a) =p. The elements of C conjugate toa € C over K are called the K-conjugates 
of a (in C). 
F1. With the notations of Definition 1, the following statements are equivalent: 

(i) B and @ are conjugate over K inC. 

(ii) B is a root of MiPox (a). 
(iii) There is an isomorphism t: K(a)/K — K(B)/K such that t(a) = B. 

(iv) MiPox (a) = MiPox (f). 
In particular, any a € C has at most [a: K] = K(a): K distinct K-conjugates in C. 
Proof. (iii) => (i): Extend t to an automorphism o of C/K (Chapter 6, Theorem 3 
and F3). 
(i) > (ii): By assumption there is an automorphism o of C/K such that o(a) = B. 
Set f = MiPox(q@). Then f(6) = f(oa) = 0 (Chapter 6, F1). 
(ii) > (iv): Set f = MiPox (a) and suppose f(8) = 0. Since f is irreducible and 
normalized, f = MiPox (B). 
(iv) = (iii): This follows from Chapter 6, F1(b). 


The last statement of F1 is clear, since f = MiPox (a) can have at most [a: K] = 
deg f distinct zeros in C. 


F2. If K is a field and C an algebraic closure of K, the following statements are 
equivalent for a givena € C with f = MiPox (a): 

(i) @ has exactly n distinct K-conjugates in C. 

(ii) There are exactly n distinct homomorphisms from K(a)/K into C/K. 
(iii) f has exactly n distinct roots in C. 


Proof. A homomorphism 1: K(a)/K — C/K is determined by the image t(q). 
Thus F2 follows from FI. 
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Definition 2. Let E/K be a field extension and suppose a € E is algebraic over K. 
The number of distinct roots of / = MiPox (a) in any splitting field of f over K 
is called the separable degree of a over K, and is denoted by 


la: K]s. 


An element @ is said to be separable over K if [a: K|; = [a: K]; otherwise it 
is inseparable over K. Thus q@ is separable over K if and only if its minimal 
polynomial over K only has simple roots (in its splitting field). 


Remarks. (a) Clearly, [a: K],; <[a: K]. 
(b) [w: K]s is the number of K-conjugates of a in an algebraic closure C of K. 
(c) Any a € K is separable over K. 


So far, so clear. Now we may ask: If a is separable over K, is every B € K(a) 
also separable over K? 


Definition 3. An algebraic field extension E/K is called separable if every element 
of E is separable over K; otherwise E'/K is inseparable. 


We also agree on the following conventions: If E,/K, E2/K are field exten- 
sions, we denote by 
G(E\/K, E2/K) 
the set of all homomorphisms FE, /K — E2/K (see Chapter 6, Definition 1). When 
E, = E, = E we use the abbreviation G(E/K) := G(E/K, E/K). 


Remark. If E/K is algebraic, G(E/K) has a natural group structure (Chapter 6, 
F3). We thus obtain the automorphism group of the algebraic extension E/K. 


Theorem 1. Let E/K be a finite extension, of degree n, and let C be an algebraic 
closure of K. 

() G(E/K, C/K) has at most n elements. 

(II) G(E/K, C/K) has n elements if and only if E/K is separable. 


We postpone for a while the proof of this important theorem; first we bring its 
content to bear: 


Definition 4. Let E'/K be an algebraic field extension and C an algebraic closure 
of K. Then 
[E: Kl» = |G(E/K,C/K)| 


is called the separable degree of E/K. (This number is independent of the choice 
of C; see Chapter 6, Theorem 2(II).) 


Remarks. Let E/K be any field extension, but assume a € E algebraic over K. 
Then 


(1) [K(a): K]s = |a: K]s 
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by FI (see also Definition 2 and Remark (b) following it). It follows further that 
(2) a is separable over K <=> K(a)/K is separable. 


For, by definition, a is separable over K if and only if [w: K], =[a: K]; and because 
of (1), this is equivalent to [K(a): K]; =[K(a): K]. But, by part (ID of Theorem 1, 
this latter equality holds if and only if K(a)/K is separable. Note that (2) answers 
in the affirmative the question posed just before Definition 3. 


We base the proof of Theorem | on the following result: 


Lemma. Let F be an intermediate field of an algebraic extension E/K and let C be 
an algebraic closure of E. Then there exists a bijection 


G(E/K,C/K) << G(F/K, C/K) x G(E/F,C/F). 
Proof. By Chapter 6, Theorem 3, there exist maps 


G(F/K,C/K)>G(C/K) |, G(E/F.C/F) > G(C/F) 


orRG THT 
with Of =o and Tg = Tt. We claim that the map 


G(F/K, C/K) x G(E/F, C/F) > G(E/K, C/K) 


(0,T) (OT)E 


is bijective. First we show injectivity: Suppose (OT) g = (017T,) z. By restriction to 
F we see first that of = (61) F, then that o = oj and therefore 0 = 0;. From the 
assumption it then follows that Tz = (11) z, which is to say t = 7. 

To prove surjectivity it is enough (by Chapter 6, Theorem 3) to prove that, for 
every W € G(C/K), there exist o and t as above, such that we = (OT)x£. For 
a given w set o := Wr and then t:= G 'We. Then o € G(F/K,C/K) and 
t € G(E/F,C/F), because 6~! Wx fixes F pointwise. By definition 6~!we =Tz, 
SO WE = OTE = (OT)E. 


Proof of Theorem 1. We work by induction on n = E: K. For n = 1 the assertion 
is clear. Suppose 7 > 1; then there exists a € E \ K. For F = K(q@) we then have 
E:F <n. By the lemma, 


(3) |G(E/K,C/K)| = |G(F/K, C/K)|-|G(E/F,C/F)|. 


The first factor on the right-hand side is at most Ff’: K because of FI (see also 
Remark (a) following Definition 2). The second factor is at most E': F by the 
induction hypothesis. Thus the left-hand side of (3) is worth at most [F: K]|E: F]= 
[E : K] =n, proving assertion (1). 

For assertion (II), assume E'/K is separable. Trivially, F/K is separable; but 
also E'/F is separable, because for each 6 € E the polynomial MiPor(f), being a 
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factor of MiPox (8), has only simple roots. By the induction hypothesis and (3) we 
conclude that |G(E/K,C/K)| =[F: K][E: F])=[E:K]=n. 
Conversely, assume |G(E /K,C /K)| =n. Taking (3) and (I) into account we 
see that 
|G(F/K,C/K)|= F:K 


An application of F2 to F = K(q) then shows that a is separable over K. Now 
recall that this holds for any a € E € K. Thus E/K is separable (because each 
a € K is separable over K). 


In view of Definition 4 we can reformulate Theorem | as follows: 
Theorem 1’. Let E/K be a finite field extension. Then [E : K], is finite, and 
(4) [E: K], <[E: K]. 

Equality holds if and only if E/K is separable. 


Remark. [E : K]; actually divides [E : K], as can be proved by induction with a bit 
more effort. But this result will follow more easily from F17 below. 


F3. If F is an intermediate field of an algebraic extension E/K, 


(5) [E: K], =[E: F]s-[F: K]s. 


Proof. This is an immediate consequence of the lemma. 


F4. Given an extension E/K, there is equivalence between: 
(i) E/K is finite and separable. 


(ii) E is generated over K by finitely many separable algebraic elements a1,..., 
An € E. 


Proof. (ii) > (i): E/K is clearly finite (Chapter 2, F7). By induction on m and 
using Theorem 1’ we get E: K =[K(a1,...,0@m): K(a1)]s-[K(a1): K]s, since a 
is separable over K. Then F3 implies that [E: K] =[E: K],, so E/K is separable 
by Theorem 1’. 


The implication (i) > (ii) is obvious. 


F5. Consider an algebraic extension E/K. Then 
E,={a¢€E|a separable over K} 


is an intermediate field of E/K, called the separable closure of K in E. 


Proof. For a, B € E's, consider the subextension K(a, B)/K of E/K. By F4 this is 
separable; therefore K(a, 6) C Es, and the assertion follows. 


F6. Let E/K be an algebraic field extension and let A be a subset of E. If A only 
contains elements separable over K, the extension K(A)/K is separable. 
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Proof. By F5 we have K(A) © Es. 


F7 (Transitivity of separability). Let L be an intermediate field of an algebraic field 
extension E/K. If both E/L and L/K are separable, so is E/K (and conversely). 


Proof. In the case of finite extensions the assertion is clear; see F3 and Theo- 
rem 1’. The general case is reduced to the finite case as follows. For B € E, 
let @o,@1,...,@y,—1 be the coefficients of f/ = MiPoz(f). Then / is also the 
minimal polynomial of 6 over the subfield F := K(ao,0,...,Q@,—1). Since B 
is separable over L by assumption, f has only simple roots. This implies the 
separability of F(8)/F. But in view of F4, F/K too is separable, because by 
assumption @,@1,...,@,—1 are separable over K. 


2. And now at long last: 


Definition 5. Let K be a field. A polynomial f € K[X] of degree n > 1 is called 
separable if it has n distinct roots in the splitting field of f over K. 


Remarks. (a) Let L/K be an extension and let a € L be algebraic over K. In view 
of the definition, @ is separable over K if and only if the minimal polynomial of a 
is separable over K (see F2). 


(b) Let f € K[X] be any nonconstant polynomial and let E by a splitting field of 
f over K. The prime factorization of f in EX] has the form 


F(X) = y(X — 01) (X — 012) + (Kay), 
with the a; all distinct. Then / is separable if and only if all the e; equal 1, that is, 
f has no multiple roots. 


(c) In analysis, the differential calculus is a useful tool for dealing with multiple 
roots. In algebra we make do with the following formal differential calculus on 
polynomials: Given a polynomial f = Yj-0 aX! in K[X], the (formal) derivative 
of f is f’ := '_,ia;X'"!. The map K[X] > K[X] defined by fh /f’ is 
obviously linear: 


(af + bg) =af' +be', 


and satisfies the product rule with respect to multiplication: 


(fey = f'gt+ea'f. 


(Because of linearity this just has to be verified in the case that f = X‘ and g = Y/ 
are monomials.) 


F8. With the preceding notation, suppose that f(a) =0 for some a € E. Then a is 
a multiple root of f if and only if f’(a) = 0. 


Proof. By assumption we have f(X) = (X¥ —a@)*g(X) in E[X], with e > 1 and 
g(a) 4 0. Differentiation gives 


f'(X) = e(X — a)" g(X) + (X —@)?g"(X). 


Substituting w of Y shows that f’(a) =0 <— > e>2. 
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F9. A polynomial f € K[X] is separable if and only if f and _f"' are relatively prime 
in K[X]. 


Proof. Let E be a splitting field of f over K. By F8, f being separable is equivalent 
to f and f’ being relatively prime in E[X]. But for polynomials in K[X], relative 
primeness in EX] is the same as in K[X]; see §4.7 in the Appendix. 


F10. If f € K(X] is irreducible, then f is separable if and only if f’ # 0. 


Proof. We use F9. If f, f’ are relatively prime, f’ 4 0. Now suppose instead that 
f, f’ are not relatively prime in K[X]. Because / is irreducible, f must divide /’; 
but since deg f’ < deg /, this is only possible if f’ = 0. 


F11. Let K be a field of characteristic 0. Every irreducible polynomial f € K[X] is 
separable. Thus every algebraic extension E/K is separable. 


Proof. Write f = )-7-) aiX", so that f’ = )-7_, ia; X'~!. Saying that f’ = 0 is 
the same as saying that 


(6) ia; =0 forall 1 <i <n. 


But if char K = 0 this condition is only satisfied if a; = az =... =a, = 0, that is, 
if f = do is a constant polynomial. 


3. We now turn to the case char K = p > 0. Condition (6) is then equivalent to 
ai =0 for alli such that i 40 mod p. 

As a consequence: 

F12. [fchar K = p > 0, 

(7) f=0 => fek[X?] 


for f € K[X]. Thus, if f is assumed irreducible, it is separable if and only if does 
not lie in K[X?]. 


Thus, over a field of characteristic p > 0 there exist nonconstant polynomials 
with zero derivative; any polynomial of the form 


cote X? +e.X72P 4...4 XP 


(for & = 1) has this property. (Whether there really are any irreducible polynomials 
in K[X?] is a question we have not broached yet.) 


F13. Let E/K be a field extension with char K = p > 0. Ifa € E is algebraic over 
K, there exists an integer m = 0 such that a?” is separable over K. 
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m 


Proof. Set f = MiPox(q@). Clearly there is an integer m > 0 such that fe K[X? ] 
but f ¢ Kixey: Thus there is a polynomial g € K[X] such that 


(8) f(X) = g(X?") but g ¢ K[X?]. 


We wish to show that g is irreducible. Suppose g(X) = /1(X)/A2(X) in K[X]. 
Then g(X?”") = hy(X?")hy(X”"). But f in (8) is irreducible, so hy € K* or 
hy € K*. Since g(a?) = f(a) = 0 we then have g = MiPog(a””). By F12, g 
is separable. 


F14. For an arbitrary field K of characteristic p > 0, the map 
(9) ara? 
of K into itself is a homomorphism; in particular, a? = 1 in K if and only if a = 1. 


Proof. Obviously, (a8)? =a? B? and 1? = 1. Also, by (15) in Chapter 5, (a+ 6)? = 
a? + B?. For the last assertion see again Chapter 4, F15. 


Remark. In the case of a finite field K of characteristic p, the injective map (9) 
is also surjective. Thus we get an automorphism op of K. Consequently, for any 
power g = p” of p, the map o, : a — a is likewise an automorphism of K, since 
Oq is the n-th power of op in Aut K. 


Definition 6. An algebraic extension E/K is called purely inseparable if any a € E 
not belonging to K is inseparable over K. 


F15. Let E/K be purely inseparable with E #4 K (which implies that char K = p > 
0). Given any a € E one can find values of m € N such that a?” € K; ifm is taken 
as the smallest such integer, 


(10) De ee 
is the minimal polynomial of a over K. 


Proof. By F13 there exists m > 0 such that a?” is separable over K, and hence lies 
in K by the inseparability assumption. Let m be minimal with this property, and 
let f(X) the polynomial (10). Then f(X) € KLX]. But f(X) = (X —a)?” since 
char K = p, so if g is an irreducible factor of f in K[X], the prime factorization 
of f in K[X] must have the form 


f(X) =8(X), with 1<j <p”. 
But p” = deg f = j deg g, so deg g = p” with n <m. Then 
g(X) =(X —a)?" = X?"— a", 


so a?” € K because g(X)€ K[X]. But m was assumed minimal, so n = m. Thus 
f(X) = g(X), and f is irreducible. 
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F16. Let E/K be an algebraic extension and C an algebraic closure of E. The 
following statements are equivalent: 


(i) E/K is purely inseparable. 

(ii) [E: K]y = 1. 
(ili) Every K-homomorphism o : E — C fixes E pointwise. 
(iv) No a € E is conjugate over K to B € C distinct from a. 


Proof. (i) > (ii): Take o € G(E/K, C/K) anda € E. By F15 we have y:=a?""€ K 
for some m € N. An application of o yields 


o(a)?" =o(a?") =o(y)=y =a?”, 
which by F14 implies o(a) = a. Thus G(E/K,C/K) contains a single element, 
the inclusion EF > C. 
(ii) => (iii) and (iii) > (iv) are clear from the definitions. 


(iv) => (i): Take a € E and f = MiPox(qa). All roots of f in C are conjugate to 
a over K. By the hypothesis, then, a can only be separable over K if deg f = 1, 
that is, ifa € K. 


Remark. Let F be an intermediate field of an algebraic extension E/K. By F3 we 
have [E: K]s =[E: F]s-[F: K]s. It follows from this and F16 that E/K is purely 
inseparable if and only if E/F and F/K are purely inseparable. 


F17. Let E/K be a algebraic extension and E; the separable closure of K in E. 
(a) E/E; is purely inseparable. 
(b) If Es/K is finite, so is [E: K|s; moreover [E: K]s = Es: K. In particular, if 
E/K is a finite extension, |E: K]s divides E: K. 
(c) If E/E; is finite, E: Es is a power of p = char K. 


Proof. (a) Let a € E be separable over Es. Then E;(a)/ F's; is separable, and by F7 
so is E(a)/K. It follows that E';(a) C Es, that is, a € Es. 


(b) By F3 we have [E: K]s = [E: Es]s-[Es: K]s. By part (a) and F16, the first 
factor must equal 1. The second coincides with E,: K, since E,/K is separable 
and, by assumption, finite. (By the way, from [E’: K]s; < oo it follows conversely 
that Es: K <oo. Think about it for a while. A proof can be given using Theorem 3 
in Chapter 8, and is left as an exercise; see also §8.22 in the Appendix.) 


(c) We must show that the degree of a finite, purely inseparable extension E/K is 
a power of p. Take a € E~ K. Since E/K(a), too, is purely inseparable (see the 
preceding Remark), it can be assumed by induction that EF: K(a) is a power of p. 
By F15, K(a@):K is also a power of p, and thus so is E: K =[E: K(a)]-[K(qa): K]. 
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Remark. Keep the notation of F17. Then 
(11) [E: K]i:= EB: Es 
is called the inseparable degree of E/K. (See also §7.9 in the Appendix.) 


Definition 7. A field K is called perfect if every algebraic extension over K is 
separable. 


By F11, every field of characteristic 0 is perfect. Assume char K = p > 0. Let 
(12) K? :={a? |ae K} 
be the set of p-th powers of elements in K. By F14, this is a subfield of K. 
F18. If K is perfect and K'/K is an algebraic field extension, K' is also perfect. 


Proof. Let E/K’ be any algebraic extension. Since E/K’' and K’/K are algebraic, 
so is E/K. Since K is perfect, E/K is separable. Then E/K’ is also separable. 


F19. A field K with char K = p > 0 is perfect if and only if K? = K. In particular, 
every finite field is perfect. 


Proof. Let K be perfect. For a given a € K, let E be a splitting field of the 
polynomial f(X) = X? —a@ over K. In E this polynomial has a root, that is, there 
exists a 6 € E such that B? =a. Since the map x b x? of E into itself is a field 
homomorphism and hence injective, f has only this one root 8 in E (alternatively, 
this follows from the equality ¥? — B? = (X — B)”). Thus f(X) = (X — B)?. 

Now set g = MiPox (6). By assumption, g is separable. But g divides /, so it 
must be of the form g(X) = X¥ — B. Thus Be K. 


Conversely, assume K = K?. Suppose there is an inseparable irreducible poly- 
nomial f € K[X]. By F12, it must have the form f(X) = g(X”), with g € 
K[X]. Since K = K?, we have g(X) = )°b? X' with b; € K. It follows that 
f(X) = g(X?) = Vo bP XP! = (9b; X")”; that is, f is not irreducible in K[X], 
contradicting the assumption. 


Finally, if K is finite, K? = K by the Remark following F14. 


Remark. As an example of a nonperfect field, consider K = F,(t), the field of 
rational functions in one variable over the field F, with p elements. Indeed, the 
polynomial f(X) = X? —t € K[X] is irreducible (Eisenstein’s criterion, F10 in 
Chapter 5), but not separable (F12). Associated with f is the purely inseparable 
extension K(t!/?)/K of degree p (where t!/? is defined as the unique p-root of ¢ 
in a fixed algebraic closure of K). 
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Galois Extensions 


1. Let E be a field. If G is a group of automorphisms of E, the set 
E® :={a¢€ E|oa=a forall cE G} 
is called the fixed field of G. (It is clear that E© really is a subfield of E.) 


F1. Let E be a field and G a group of automorphisms of E; denote by K = E® the 
fixed field of G. Take a € E. If the set 


Ga:={oa|oeEG} 


is finite, a is algebraic over K. If Ga contains exactly n distinct elements, say 
Q1,Q2,...,Qy (one of them being a), the minimal polynomial of a over K is the 
separable, normalized polynomial 


n 


(1) f(X) =|] (X-«). 


i=1 


Proof. Each t € G gives rise to a ring isomorphism t : E[X] > E[X], sending 
g(X) = >° 5; X! to g*(X) = > 1(b;) X". Thus the polynomial / in (1) satisfies 


(2) f7(x) = [[@-7@)). 


i=1 


But tT gives rise to a permutation of a1, 02,...,Qn, Since for every o € G we have 
t(oa) = (to) a € Ga, and plus Tt is injective. Thus (2) implies that f*(X) = f(X) 
for every t € G. By definition, then, all the coefficients of f already lie in the fixed 
field K of G. But @ is a root of f(X) € K[X], and therefore algebraic over K. 
Let g = MiPox (a). Then in any case g divides f in K[X]. Since g(a) = 0 we 
have g(oa) = 0 for every o €G, that is, g(a;) = 0 fori = 1,2,...,”. Thus g has 
at least degree n. It follows that f = g, since both polynomials are normalized. 
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Definition 1. An algebraic field extension E/K is a Galois extension if 
(3) K = E&E/K) 

The group G(E/K) is then called the Galois group of E/K. 


Remarks. (a) For an arbitrary extension E/K, it is clear that K C E&(4/*), 
(b) Let G be a group of automorphisms of a field E, with fixed field K = E®. If 
E/K is algebraic, it is a Galois extension. This is because G C G(E/K) by 
definition, and hence E&(4/5) c E© = K. Because of (a) equality (3) holds. 


Theorem 1. For an algebraic extension E/K there is equivalence between: 
(i) E/K is Galois. 
(ii) E/K is normal and separable. 


Proof. (ii) > (i): Let a be any element of F not lying in K. We must show that 
there exists t € G(E/K) such that ta #4 a. Let C be an algebraic closure of E. 
By assumption, a is separable over K, so [K(a): K]; = K(a): K #1. Thus there 
exists o € G(C/K) such that ow 4 a (see Chapter 7). Since E/K is normal, o 
restricts to an automorphism t € G(E/K) by Chapter 6, Theorem 4. We thus have 
Ta =oa # as required. 


(i) => (ii): Take a € E and set f = MiPox (a). We must show that f is separable 
and that it splits into linear factors over E. But this follows immediately from F1, 
with G = G(E/K). The necessary assumption that {oa |o € G} be finite is satisfied 
since all the ow are roots of /. 


F2. Let f € K[X] be a separable polynomial with splitting field E over K. Then 
E/K is a finite Galois extension. (Instead of assuming that f is separable, it is 
enough to assume that the prime factors of f in K[X] are separable.) 


Proof. E/K is normal, according to F5 in Chapter 6. Let a1,...,@, be the roots of 
f in E. Then FE = K(qj,...,@,). Each a; is separable over K (see Definition 2 
in Chapter 7). Thus £/K is finite and separable, by F4 in Chapter 7. 


Theorem 2. Let E'/K be a Galois extension. For every intermediate field F of E/K, 
the extension E/F is also Galois: 


F = EGE/F). 
The map 
Fw G(E/F) 


from the set of intermediate fields of E/K into the set of subgroups of G(E/K) is 
therefore injective. 


Proof. Clearly E/F is algebraic, separable and normal if E/K has each of these 
properties. Thus E'/F is a Galois extension, by Theorem 1. 
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Is F/K a Galois extension in the situation of Theorem 2? In general, no. True, 
F/K is trivially separable, but it need not be normal, because it is not necessarily 
the case that oF C F for each o € G(E/K). 


F3. If F is an intermediate field of a Galois extension E/K, 
(4) G(E/oF)=o0G(E/F)o' for any o € G(E/K). 


Moreover, the following statements are equivalent: 
(i) F'/K is a Galois extension. 
(ii) oF = F forallo € G(E/K). 

(iii) oG(E/F)o—! = G(E/F) for allo € G(E/K). 


Proof. To prove (4), note that, for an arbitrary t € G(E/K), 


T€G(E/oF) <> t(oa) =o for allae F <> (ao !t0)a=a for all we F 
<=> o 't0 €G(E/F) <= 1€0G(E/F)o!. 


(Gi) <= > (ii): In view of (4) and Theorem 2 we have 
oF =F <> G(E/F)=G(E/oF)=o0G(E/F)o"!. 


(i) < > (i): In view of Theorem 1 we just have to prove that (ii) is equivalent to 
F/K being normal (F/K is separable in any case). Let C be an algebraic closure 
of E (and so also of F and K). We use the normality criterion (ii’) of Chapter 6, 
Theorem 4. 

Any o € G(E/K) extends to some t € G(C/K) (Chapter 6, Theorem 3 and 
F3). If F/K is normal we get tF = F, hence oF = F. 

Conversely, if of = F for all o € G(E/K), it is also true that tF = F for all 
t € G(C/K), since any t € G(C/K) restricts to o € G(E/K) (by the normality of 
E/K). Therefore F/K is normal. 


The proof shows that if E/K and F/K are normal extensions with F C E, and 
C is an algebraic closure of E, there is a commutative diagram 


G(C/K) —> G(E/K) 


Boe 


G(F/K) 


(5) 


with well defined homomorphisms r, p, r’, each of them surjective. 
Definition 2. Let G be a group. A subgroup H of G is called normal if 
oHo |=H foralloe€G. 


Thus the notion of a normal subgroup comes up quite naturally in the study of 
fields; but it is also a key notion in group theory: see the remarks following the 
Fundamental Homomorphism Theorem (F3 in Chapter 3). 
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F4. We keep the assumptions of F3. 
(a) F/K is a Galois extension if and only if GE/F) is a normal subgroup of 
G(E/K). 
(b) If F/K is Galois, the natural map p: G(E/K) > G(F/K) gives rise to an 
isomorphism of groups 
(6) G(E/K)/G(E/F) > G(F/K). 
Proof. Part (a) is clear from F3 and Definition 2. We prove (b). We know that p is 
surjective. By definition, 


ker p= {o € G(E/K) | ox =x forallxe F} = G(E/F). 


We conclude by invoking the Fundamental Homomorphism Theorem for groups 
(remarks following F3 in Chapter 3). 


F5. Let E’ be a normal closure of an algebraic extension E/K see (Chapter 6, F6). 
If E/K is separable, so is E'/K (and therefore E'/K is Galois). 


Proof. Suppose E = K(B). In an algebraic closure C of E’, let A be the set of all 
roots of minimal polynomials of a € B. Then E’ = K(A). If E/K is separable, 
every element of A is separable over K. By F6 in Chapter 7, this implies E’/K 
separable. 


2. We now wish to study particularly the implications of finiteness. 
F6. If a Galois extension E/K is finite, the Galois group of E/K is also finite, with 
IG(E/K)| = E:K. 


Proof. Let C be an algebraic closure of E. Since E'/K is separable, [E: K] = 
[E: K]s = |G(E/K,C/K)|. Since E/K is normal, we can identify G(E/K, C/K) 
with G(E/K, E/K) = G(E/K) (Chapter 6, Theorem 4). 


Theorem 3 (Primitive element theorem). A field extension E/K that is finite and 
separable is also simple, that is, E = K(a) for somea € E. 


Proof. In view of Chapter 3, Theorem 5, it suffices to show that E'/K has only 
finitely many intermediate fields. Let E’ be a normal closure of E/K. If E’/K 
has only finitely many intermediate fields, so does E/K. By F5, E’/K is a Galois 
extension; it is also finite by F6 in Chapter 6. Thus we might as well assume that 
E/K is Galois to begin with. Then G(£/K) is a finite group (F6) and as such has 
only finitely many subgroups. By Theorem 2 this means that E/K has only finitely 
many intermediate fields. 


Theorem 4. Let E be a field and G a finite group of automorphisms of E.,, with fixed 
field K = E®. The extension E/K is finite and Galois; moreover, 


G =G(E/K), 
that is, G coincides with the Galois group of E/K. 
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Proof. In view of Fl, E/K is algebraic, separable and normal. Thus it is a Galois 
extension. Let d := |G| be the order of G. By F1 we know at first only that 


(7) K(a):K<d_ foreachae E. 


There is certainly some a € FE for which K(a): K is maximal. Take any 6 € E. 
By Theorem 3, there exists y € E with K(a, 6) = K(y). From our choice of a 
we have K(y): K < K(a): K; therefore K(a) = K(y) 3 B. Thus E = K(q). In 
particular, E'/K is finite, and (7) becomes 


(8) E:K<d=|Gl. 


The inclusion G C G(£/K) is trivial. But |G(E/K)| = E:K by F6, so in view of 
(8) we must have G = G(E/K). 


For another justification of (8) see §12.6 in the Appendix. 


Theorem 5 (Fundamental theorem of Galois theory for finite Galois extensions). 
Let E/K be a finite Galois extension. Then the map 


(9) Fw G(E/F) 


is a bijection between the set of intermediate fields F of E/K and the set of sub- 
groups H of G:= G(E/K). Each extension E/F is Galois and satisfies 


(10) E: F=|G(E/F)|. 
Moreover there is an equivalence 

(1) FiCF, => G(E/F\) 2 G(E/F). 
The inverse of (9) is the map 

(12) Ht E” = the fixed field of H. 


F'/K is a Galois extension if and only if GE /F) is anormal subgroup of G. If F/K 
is Galois, one obtains by restriction a natural isomorphism 


(13) G(F/K) = G(E/K)/G(E/F). 
Proof. Theorem 2 says that, for every intermediate field F of E/K, 
(14) BOO) ae 


Since £/K was assumed finite, G = G(E'/K) is finite by F6, and hence so is any 
subgroup of G. By Theorem 4, then, 


(15) G(E/E")=H 


for any subgroup H of G. Thus, by (14) and (15), the maps (9) and (12) are indeed 
inverse to each other. The remainder of Theorem 5 is now clear (look again at 
Theorem 2, F6, and F4). 


80 8 Galois Extensions 


3. As can easily be seen from Theorem 3, if E/K is a finite Galois extension, E 
is the splitting field of a separable (and irreducible) polynomial f € K(X] over K. 
By F2 the converse also holds: If f € K[X] is a separable polynomial and E is 
a splitting field of f over K, then E/K is a finite Galois extension. The following 
definition is therefore apposite and convenient (and is close to the original definition 
of a “Galois group”; see Evariste Galois, Guvres mathématiques, Paris, 1897): 


Definition 3. Take f € K[X]. Assume that f is separable (or just that all prime 
factors of f are). Let E be a splitting field of f over K. The Galois group of 
the finite Galois extension E'/K is also called the Galois group of f over K, or the 
Galois group of the equation f(X) = 0 over K. 


Example. The Galois group of ¥? — 2 = 0 over Q is isomorphic to the symmetric 
group S3. 

Proof: X3—2 has three distinct roots a1, @2,@3 in C. Let G be the Galois group 
of ¥?—2 over Q. Every o € G permutes @;, &2, 03. Thus one gets a homomorphism 


G > $3. 


This map is injective, since o is uniquely determined by its action on a1, 2,03. 
Therefore G is isomorphic to a subgroup of S3. Now let EF := Q(a1,a2,a3) = 
Q(</2, ¢3) be the splitting field of X3 —2 over Q; then E:@ = 6 (see F10 in 
Chapter 5 and F11 in Chapter 2). Therefore G ~ S'3. 


Remark. Let f € K[X] be as in Definition 3. One sees just as in the example that, 
more generally, If f has degree n, the Galois group of f over K is isomorphic to a 
subgroup of the symmetric group S,. But it can certainly be a proper subgroup; for 
instance, the group of the equation ¥4— 2 = 0 over Q is of order 8 (whereas S4 
has order 24). To prove this one must show that the splitting field E = Q(/2,i ) 
of ¥* —2 over Q (Chapter 6, example (a) after F4) has degree EF :Q = 8. Setting 
F =Q(¥2), one sees first that F:Q =4, since ¥4—2 is irreducible by Eisenstein’s 
criterion (Chapter 5, F10). At the same time E: F = F(i): F <2; since F CR we 
have i € F, so E: F =2. From the degree formula we then get E':Q = 8. 


F7. Let G be the Galois group of a separable polynomial f € K[X] over K, and let 
N be the set of roots of f in a splitting field E of f over K. There is equivalence 
between: 


(i) f is irreducible. 
(ii) G acts transitively on N, that is, for any a, B € N there exists o € G taking a 


to B. 


Proof. (i) = (ii): Any two roots a, B € N of the irreducible polynomial f have the 
same minimal polynomial, and so are conjugate over K (Chapter 7, F1); in other 
words — letting C be an algebraic closure of E —some t € G(C/K) maps a to B. 
Since E/K is normal, t restricts to some o € G(E/K) = G such that ow = B. 
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(ii) => (i): Fix a € N and take g = MiPox (qa). Certainly g divides f. Given B ¢ N 
there exists by assumption ao € G with ow = f. But since g(a) = 0 we must have 
g(B) = g(oa) = 0. Every root of f is thus also a root of g. Since f is separable, 
this means f divides g. 


Definition 4. Let f ¢ K[X] be normalized of degree n > 1, and let E be a splitting 
field of f over K. In EX] we have 


f(x) =] [%-a), 


i=1 


where 01,...,@, are not necessarily distinct. Now consider the element 
(16) A= A(ay,...,Qn) =| [ @-a) 
i<j 


of E. Its square 


(7) D=D(f):=A7=[[@-a)=Cp"?? [] @-a) 


i<j i#i 


is obviously independent of the order in which we take a1,...,Q@n. We call D(f) 
the discriminant of the polynomial f € K[X]; it is nonzero if and only if f is 
separable. 


By the foregoing, oD = D for every o € G(E/K). Galois theory then gives 
(18) D(fyek, 


since in the inseparable case we have D(f) = 0, which implies (18) trivially. 

Furthermore, /D = Ac E by (16), so K (/D) C E. When is VD not actually 
in K, that is, when is K(/D): K = 2? This is an interesting question; here we will 
treat only the following special case, whose proof affords us a nice opportunity to 
practice our Galois-theoretical skills. 


F8. Suppose the Galois group G of an irreducible and separable polynomial f € 
K[X] over K is cyclic of even order n. So long as char K # 2, the discriminant D 
of f is not a square in K,so /D € K. 


Proof. Saying that G is cyclic is saying that any element of G can be written as a 
power o/ of a fixed o € G. Let a be a root of f in the splitting field E of f over 
K. Since G is abelian, all its subgroups are normal, so K(a)/K is normal (see F4). 
Since f is irreducible, this implies that all roots of f already lie in K(q), that is, 
E = K(qa). Thus f has precisely distinct roots 1,...,@, in E, which we can 


assume to be numbered as follows: a) =a, @2 =0d, a3 =07a,...,@, =o" !a. 


Then A =]];<;(@j—o1) =i <jcjen(07'a—0* a) = Jo cjcjen—1 (0/0 — 0a). 
It follows that F 
oA _ Vi=.@-«) 


= (-1)""' =-1, 
mise” 
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since we’ve assumed n even. Thus the element A = / D of E is not G-invariant, 
and so does not lie in K. 


Perhaps it might have been conceptually clearer to point out the relation oA = 
sgn(o) A, where sgn(o) is the sign or parity of o regarded as a permutation of S, 
(see also Problem 15.9 in the Appendix). 


Remark. Keeping the preceding notations and the assumptions of F8, the field 
K(VD) is the unique intermediate field F of E/K such that F : K = 2. You should 
now, at the end of this chapter, persuade yourself of this fact, as a consequence of 
the Fundamental theorem of Galois theory, a later result on cyclic groups (F6 in 
Chapter 9), and of course F8. 
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Finite Fields, Cyclic Groups and Roots of Unity 


1. One infinite family of finite fields has already come to our attention (Section 3.5): 
namely, for every prime number p, there is a field F, := Z/p with p elements. It 
was Galois who first discovered that there are other finite fields out there. In this 
chapter we will put together a list, so to speak, of all finite fields, and discuss some 
key properties of these fields. 


Let K be a given finite field and let k be its prime field (Section 3.5). Since & is finite 
one cannot have k ~ Q, so k ~ Fy, where p, a prime number, is the characteristic 
of K. We look at K as a vector space over k, necessarily of finite dimension. 
Setting d := K:k we then get an isomorphism of k-vector spaces K ~ k@. Thus 
the number of elements of K is 
q:=|K| = p*. 

The multiplicative group K* of K has order 

|K*|=q—-1= p11. 
We claim that a7~! = 1 for all ~w € K*. This comes from a more general fact: 


F1. If G is a finite group or order n, then x" = 1 for every x €G. 


Proof. We prove the assertion here only in the case that G is abelian (see also 
page 95). Suppose G = {a1,...,@n}, and consider g := a1d2...dy. For a given 
x €G, the map a; > xa; is a permutation of G, so g = (xa1)(Xa2)...(Xan) = 
x"d\da2...Ayn = x" g. Therefore x” = 1. 


Thus any element w # 0 of a q-element field K satisfies the equation a7! = 1, 
hence also the equation aw? = a. The latter is also satisfied by a = 0. Therefore 
every element of K is a root of X¥% — X, and by looking at degrees we see that 


(1) x4-xX =| [(¥-a). 
ack 


In particular, K is a splitting field of X% — X over Fy, and as such is uniquely 
determined up to isomorphism. So all g-element fields are isomorphic. 
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How about the existence of a finite field of p” elements (for a given prime p and 
natural number 1)? The preceding discussion shows how one must proceed: For 
q := p”, consider the polynomial 


(2) fOQ=XI-X 


over F,. Let K be a splitting field of f over F, (whose existence is guaranteed 
by Chapter 6, F4). The next two assertions then imply that K has exactly g = p” 
elements: 


(i) K contains only roots of f. 

(ii) f has no multiple roots. 
Proof of the assertions. (i) Let K’ = {a € K | a4 = a} be the set of roots of f 
(in K). By definition, K’ is the fixed field of the automorphism og : a + a of 
K (Chapter 7, Remark after F14). Hence K’ is a field. It contains F,, since F, is 
the prime field of K. Therefore K’ is a splitting field of f over F,, and because 
K'C K we have K’= K. 
(ii) The derivative of the polynomial in (2) is f’(X¥) = g¥7-!—1=-1. By 
Chapter 7, F9, f is in fact separable. (Another proof: If w 4 0 is a root of f, set 
XU 1-1 Xt 1 ge! 

X-a X—a 


thus g(a) = (¢—l)a?* £0.) 


To summarize the work so far: 


gi= = X74 0NTI 4... +00? 


Theorem 1. Let C be a (fixed) algebraic closure of Fy. For everyn € N, there is in 
C exactly one finite subfield Fyn having p” elements, namely the splitting field of the 
polynomial X?" — X over FE, in C. The elements of Fyn are precisely all the roots of 
X?" _ X. Every finite field (of characteristic p) is isomorphic to one and only one 


Fn. 


Let K be a finite field with ¢g = p” elements. Then K ~ Fg. If £ is an extension 
of K such that E': K =m, we have |E|=q”™. As we saw above, E is then a splitting 
field of ¥9"" — X over K. Therefore, up to a K-isomorphism, K has at most one 
extension of degree m over K (Chapter 6, F4). Conversely, Fgm is an extension 
of F,, for any me N. Therefore K ~ Fg clearly has an extension E such that 
E = Fgm. Because |E| = |K|#'* we then have E: K = m. Putting it all together: 


Theorem 1’. /f K is a finite field andm €N, there is an extension E of K of degree 
m, and it is unique up to K-isomorphism. 


What can one say about the structure of the multiplicative group K* of a finite 
field K? We shall see that the answer is as simple as can be: K” is cyclic. Before 
we prove this fundamental theorem, however, we will indulge in a little detour on 
cyclic groups. 
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2. First let G be any group and a an element of G. There is a well defined group 
homomorphism 


(3) yg:Z—>G, determined by lhe a. 


Let (a) := {a | m € Z} denote its image. By the Fundamental Homomorphism 
Theorem, (qa) is isomorphic to Z/kergy. But kerg, being a subgroup of Z, is 
actually an ideal of the ring Z. Since Z is a principal ideal domain, ker g has the 
form 


(4) kerg =nZ, with n € N U {0} uniquely determined. 


Definition. If n 4 0 in (4), we call n the order of a, and write n = ord a. If n = 0, 
we set ord a = oo. Since (a) ~ Z/ker gy, we see that a has finite order if and only 
if the subgroup (a) of G generated by oa is finite. 


F2. Let G be a group and a and element of G. Then 

(5) ord a = |(a)|. 

If a has finite order andm € Z, we have 

(6) a” =1 <> orda|m; 

in particular, ord @ is the smallest natural number m such that a” = 1. 


Proof. Suppose n := ord a < oo. Then (a) ~ Z/nZ, but of course |Z/nZ| = n. 
This proves (5). To prove (6), write 


a" =1 —> mekerg = menZ = alm. 


Definition. A group G is called cyclic if there exists y € G such that G = (y); such 
ay is called a generator of G. 


F3. A group G is cyclic if and only if G ~ Z/nZ for somen €N U {0}. 


Proof. If G = (y), the map ¢ in (3), with a = y, is surjective. Thus G ~ Z/ ker g = 
Z/nZ for some n € N U {0}. The converse is clear, since the residue class of 1 in 
Z/n is a generator of the group Z/n (additively written). 


If G is finite, any w € G generates a finite group and therefore has finite order. 
More precisely: 


F4. If G is a finite group and a is any element of it, the order of a divides that of G. 


Proof. If n =|G| we have a” = 1 for any a € G; see FI. The assertion then follows 
from (6) in F2. 


Remark. If @ is an element of a finite group G, we get from (5): 


orda=|G| < G = (a). 
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FS. If G is a cyclic group, so is any subgroup H of G. 


Proof. Suppose G = (y), so the homomorphism (3) is surjective, where a = y. 
The inverse image y~!(H) of H under ¢ is a subgroup of Z, necessarily of the 
form y~!(H) = kZ. Since 9 is onto, we get H = g(kZ) = (y*), that is, y* 
generates HH. 


F6. Let G be a finite, cyclic group of order n. The map 
(7) Ht |H| 


is a bijection between the set of subgroups of G and the set of natural numbers that 
divide n. 


Proof. Suppose G = (y), for a fixed generator y. If d is a natural number and d |n, 
set d’ =n/d. The order of the subgroup H(d) := (y) coincides with ord(y@). 
Because (y@’)4 = y” = 1 we have ord(y@’) |d, by (6); thus |H(d)| divides d. On 
the other hand, (y@’)!#“@I = 1 by FI, and hence n = ord y divides d’ -|H(d)|; 
since n = dd’ we conclude that | H(d)| is divisible by d. Putting it all together we 
get 


(8) |H(d)|=d. 


We claim that d +> H(d) is the inverse map to (7). Let H be any subgroup of G, 
having order d. We must show that d divides n and that 


(9) H=H(d). 


By F5, H is of the form H = (y’). Since (y”)” = 1 it is indeed the case that 
d =|H|=ord(y”) divides n. By F1 we get (y’””)“ = 1; again from (6) there follows 
n|md, which is to say d’|m. Hence y” € (y@’) = H(d), so that H C H(d). Since 
both groups have the same order d, (9) follows. 


3. We are now ready for the result promised earlier: 


Theorem 2. The multiplicative group K™ of a finite field K is cyclic. (A generator 
of K™ is called a primitive root.) 


In fact we prove something more general: 
Theorem 2’. Let K be any field. Then any finite subgroup G of K* is cyclic. 
The proof relies on the following characterization of cyclic groups: 
Lemma. A finite group G of order n is cyclic if 
(10) |{xeG|x4=1}|<d for every d|n. 


Proof of Theorem 2' assuming the Lemma. Set n := |G|. For any d € N there are in 
K at most d elements x with x4 = 1, since that’s the most roots that ¥4—1 € K[X] 
might have in K. Thus (10) is satisfied, which means G is cyclic. 
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Proof of the Lemma. For d |n, set 


Wa(d):= | {x €Glordx= d}|. 
Then 
(11) Yo ved) =n = |G, 


d\n 

since any a € G has a well defined order d, with d|n. Form €N, set 
(12) y(m) := the number of elements of Z/m that are generators. 
Then g(m) = 1. We will show that 

(13) We(d) < (da) forall d|n. 


For wg(d) = 0 this is clear. Thus suppose we(d) = 1; this means there exists 
a € G with ord a = d. Then H = (a) is a subgroup of order d in G, and x4 = 1 
for any x € H. From (10) it follows, in particular, that any order-d element in G 
already lies in H. Because H ~ Z/d, we then get wg(d) = Wy (d) = g(d), which 
yields (13). 


We remark that in the case of the group Z/n we must have 


(14) Vzjn(d) = (4), 


since all order-d elements of Z/n lie in one and the same order-d cyclic subgroup 
of Z/n (see F6 and F5). By summation we get from (13), in conjunction with (11) 
and (14), 

n=) Weld) <>) ed) = do Wan(d) = 2. 


d|n d\n d\n 


But in view of (13) this can only happen if wg (d) = ¢(d) for all d|n. In particular, 
wa(n) = g(n) = 1, so G possesses an element of order n. 


Remarks. Let the quotient map Z > Z/n be written k + k. Then 


n 


Proof: We have 
n n 


Ce a 


so j := ord k divides the right-hand side of (15). Next, 7k = 0, so n| jk. But this 
implies that n/(k,n) divides /. 


-1=0, 


If G is a group and a € G has finite order n, we have for every k € Z 


n 


ky _ 
(16) ord(a") = ey 
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To prove this observe that (a) ~ Z/n and apply (15). 
According to (16) we have ord(a*) = n if and only (k,n) = 1, that is, when k 
and v are relatively prime. Thus the function g defined in (12) satisfies 


(17) y(n) = |{k EeN|1<k <n with (k,n) = 13]. 


For this reason the function @g is of interest in number theory; it is called Euler’s 
totient function or g-function. Now note that 


(nk) =1 => ordk =n => Z/n=(k) => Le (k) > BW: =k Sik. 


Therefore (n,k) = 1 if and only if k is a unit in the ring Z/nZ. In particular, the 
group of units (Z/nZ)* of Z/nZ has exactly y(n) elements: 


(18) y(n) = |(Z/n)"|. 


We call (Z/n)* the group of prime residue classes modulo n. The function @ is 
multiplicative in the following sense: 


(19) p(nin2) = (m1) (nz) if (141,02) = 1. 
To see this, consider the natural ring homomorphism 
Z/[nynz > Z/nyxZ/no. 


Because n; and m2 are assumed relatively prime this map is injective; since both 
domain and counterdomain have n;n2 elements, it must be an isomorphism. But a 
ring isomorphism implies an isomorphism of the corresponding groups of units, in 
this case (Z/njn2)* ~ (Z/n1)* x (Z/n2)*. Keeping (18) in mind, we get (19). 


4. We now talk a bit about the third item in the title of this chapter. 


Definition. If K is a field, denote by W(K) the set of all elements of finite order 
in the group K~. These elements are called roots of unity. For n € N, set 


Wi(K) = {Fe K|o" = 1}. 


The elements of W,,(K) are the -th roots of unity of K. Clearly W(K) and W,,(K) 
are subgroups of K*. An element € € W(K) is a primitive n-th root of unity if 
ord €=n. 


F7. For any n, W,(K) is a finite cyclic group whose order divides n. If K has 
characteristic p > 0, then Wap(K) = Wi (K). 


Proof. W,(K) is finite because its elements are the roots of the polynomial X” —1 in 
K. Being a finite subgroup of K*, however, W,,(K) must be cyclic; see Theorem 2’ 
in Section 9.3. The order of a generator divides n, by F2. 

Now suppose char K = p > 0. Since ¢"? = (¢")? = 1 we have €” = 1 by F14 
in Chapter 7. Thus W,,(K) © W,(K). The opposite inclusion is obvious. 
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Remark. W,,(C) = {e27'*/" | k =0,1,2,...,n—1} has n elements. 


F8. Let C be an algebraically closed field, and taken € N. In case char C = p > 0 
assume also that (n, p) = 1. Then W,(C) has order n. Thus C contains a primitive 
n-th root of unity ¢, and any such is a generator of W,,(C). 


Proof. Consider f(X) = X” — 1 € C[X]. We must show that f has no multiple 
roots; the rest then follows from F7. Now f’(X) = nX"~!; because char K does 
not divide n, it follows that f’(¢) #0 for any €€C%. 


Incidentally, if charC = p > 0 and n is arbitrary, F7 shows that W,(C) has 
order np~“?™), where Wp(n) is defined as on page 40. 


Definition. Let K be a field and take n € N. The splitting field E of X¥” —1 over 
K is called the field of n-th roots of unity over K. We use the notation E = K® = 


K(¥V1). 


F9, Let E = K(¥/1) be the field of n-th roots of unity over K. Then E/K is a (finite) 
Galois extension, with abelian Galois group G. If n is not divisible by char K, then 
G is canonically isomorphic to a subgroup of (Z/n)*. 


Proof. Suppose n is not divisible by char K. The polynomial f(X) = X” — 1 is 
then separable over K, as seen in the proof of F8. Hence F is a splitting field of 
a separable polynomial over K, and E/K is a Galois extension by Chapter 8, F2. 
(That this is still the case for arbitrary n follows from F7.) 

Let ¢, € E be a primitive n-th root of unity, so ord ¢, =n. Given o € G, the 
image o¢, also has order n, so 


(20) olny =C* with (k,n) =1, 


where & can be uniquely determined by the condition 1 < k <n. Then any ¢ = ti 
in W,,(E) satisfies ¢ = ¢*, so k in (20) is independent of the choice of ¢,. In this 
way we get a well defined map 


(21) G—>(Z/n)*, owk. 


It is easy to see that this is a group homomorphism. It is also injective, since o 
is determined by of, —recall that E = K(¢,). Now G, being isomorphic to a 
subgroup of the abelian group Z/n, is abelian as well. 


As can be seen already from K = R or C, the map (21) is generally not surjective. 
(For more interesting examples see F11 in Section 9.5.) But in the case of K = Q 
we have: 


Theorem 3 (Gauss). Let E = Q(/1) be the field of n-th roots of unity over Q. 
The Galois group of E'/Q is canonically isomorphic to the group of prime residue 
classes (Z/n)*. In particular, Q(/1):Q = y(n). 
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Proof. Let € be a primitive n-th root of unity in E'. In view of the homomorphism 
(21), we must show that for each k € N relatively prime to n, there exists 0 € 
G(E/K) with of = ¢*. By F7 in Chapter 8, this boils down to showing that C* is 
also a root of f := MiPog(6): 


(22) ie 0: 


(Group-theoretically, ¢* with (k,n) = 1 is certainly specifiable as the image of an 
automorphism of the group (¢); however, to derive the existence of a corresponding 
field automorphism of Q(¢), we must ensure that C¢* satisfies the same defining 
equation as ¢, that is, (22) must hold.) By expressing k as a product of prime 
numbers, one sees that it suffices to prove the assertion for the case of k = pa 
prime, where furthermore p does not divide n. In Q[X] we have the decomposition 


(23) MEV F(X) a(x). 


Then Gauss’s Lemma (Chapter 5, F7) implies that f(X), g(X) € Z[X]. Now we 
assume, contrary to claim, that f(¢?) 40. By (23) we have g(¢”) =0. Therefore ¢ 
is a root of the polynomial g(X”), and so we get, again by using Gauss’s Lemma, 
a decomposition 


2(X”) = f(X)A(X) with A(X) € Z[X]. 


Passing to polynomials over F, = Z/p via the natural map Z[X] — (Z/p)[X], we 
get the decomposition 


(24) a(X") = f(X)A(X) in FLX]. 


But for any polynomial Z(X) = )° aX‘ over Fy we have F(X)? = (> aX")? = 
ye aP X'P = >0a;X?! = F(X”). Thus (24) can be written in the form 


a(X)? = f(X)A(X) in FLX]. 


In an algebraic closure of F,, therefore, ra and g must have a common root, so 
X"—1= f(X)zZ(X) cannot be separable. This contradicts F8, because (p,n) = 1. 


In the sequel let C be an algebraically closed extension of @ (for instance C=C, 
or the field of algebraic numbers in C). Set W, = W,(C). Then 


(25) x"-1= [[ (x-6. 
feWn 
Definition. The polynomial 
F,(X)= |] (x-9. 
ord €=n 


where the product is taken over all primitive n-th roots of unity of C, is called the 
n-th cyclotomic polynomial. 
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F10. The n-th cyclotomic polynomial Fy has the following properties: 
(a) Fy, is normalized. 
(b) deg Fy, = g(n). 
(c) X"-1l= Tain Fq(X). 
(d) Fy, (X) € Z[X]. 
Proof. Parts (a) and (b) are immediate. Since any ¢ € W,, has a well defined order 


d with d|n, part (c) follows from (25). Part (d) can be proved by induction on n: 
Part (c) implies that 


F,(X) = orl Fq(X), 
d<n 


and in Z[X] one can always divide with remainder by normalized polynomials. 
Another proof of (d): Any o in the Galois group G of Q(+/1)/Q satisfies 


Fr(X) = Tora gan(¥ — 0 (O)) = F(X), 


so all coefficients of F,(X) lie in the fixed field of G, and so, by Galois theory, also 
in @. But once we know that F, € QLX], it follows that F,,, being a normalized 
factor of X” — 1, must lie in Z[X] (Chapter 5, F7). 


Theorem 3’ (Gauss). The n-th cyclotomic polynomial F,(X) is irreducible in Q[X]. 


Proof. Let ¢ be a primitive n-th root of unity. We must show that 
F(X) = MiPog (f), 


since we already know that F,,(X) is a normalized polynomial in Q[X] vanishing 
at ¢. By Theorem 3, Q(¢):@ = y(n) = deg Fy, which is all is needed. 


Remark. Let p be a prime number. Then 


ae -1 
(26) F(X) = Fa lt Xt x? 


and more generally 


m 


xe" _] 
xen _} 


m—1 
’ 


(27) Fym(X) = = 14 xP" 4... 4 XC-DP 


in particular, then, 
(28) g(p™) = (p—1) p™" = p™— p™ |. 


The irreducibility of (27) had already been proved in Chapter 5, F13 using a different 
method (yet one based on the same principles in a way). Of course (28) is also easily 
derived from (17). And one may observe that granting the validity of Theorem 3’ 
the assertion of Theorem 3 can be derived immediately. 
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Ifn = Die DS ote pe is the prime factorization of a natural number n > 1, it 
follows from (19) together with (28) that 


(29) on) =T] i-dpi. 
i=1 


5. We now return once again to finite fields, considering them from the viewpoint 
of Galois theory. Though simple, the relation here is of greatest significance for 
deeper arithmetic questions. 


Theorem 4. Let E'/K be any extension of finite fields and let q be the number of 
elements of K. Then E/K is Galois with a cyclic Galois group, and in fact G(E/K) 
is generated by the automorphism og : a — a4 of E. (We call og the Frobenius 
automorphism, or simply the Frobenius, of E/K.) 


Proof. Let G be the subgroup of Aut E generated by o,. Then K is the fixed field of 
G in E, since by Section 9.1 K contains exactly those a € E such that a7 =a. By 
Galois theory (Chapter 8, Theorem 4), £/K is a Galois extension and G = G(E/K) 
is its Galois group. 


We could also have proved this result without depending of Chapter 8; by and 
large, Galois theory results for finite fields can be verified directly without much 
trouble. 


F11. Let K be a finite field with q elements. Given a natural number n relatively 
prime to q, denote by q mod n the residue class of q in (Z/n)*. Then 


K(V1):K = ord(g mod n). 


Proof. Set E = K(*/1) and let ¢ be a primitive n-th root of unity in E. Theorem 
4 shows that EF: K equals the order of og € G(E/K). Thus, since E = K(¢), the 
degree E: K is the smallest natural number / such that ca! = ¢; that is, gf = 
1 mod n. In other words, E: K is the order of g mod nv in (Z/n)”. 


For practice, derive F11 directly from Theorems | and 2, without appealing to 
Theorem 4. 
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Group Actions 


According to the Fundamental Theorem of Galois theory, the intermediate fields of 
a finite Galois extension £/K are in one-to-one correspondence with the subgroups 
of the Galois group of E'/K. This by itself would be reason enough to study groups, 
and thus it is time for us to turn our attention to some key notions of group theory. 
Natural examples of groups usually come up in mathematics as automorphism 
groups of certain structures. Our investigations in Chapter 8 illustrate this typical 
trend (among others). Think also of concepts such as the linear group of a vector 
space, the orthogonal group of a quadratic form, etc.; or even, if you will, the group 
of bijections of a nonempty set !/ — where the structure in question is a bare set. 
In the sequel, G will denote a group and M a nonempty set. 


Definition 1. We say that G acts (or operates) on M if there exists a group homo- 
morphism 
T:G—>S(M) 
ot T(o)=T, 


from G into the group of permutations of M. The result of applying o € G to 
x € M is denoted also by 
0.x =ox =T,(x). 


Thus one gets a map 


1 GxM—>M 
(1) (0,x) Rox 
satisfying 
(i) lx =x and (ii) (ot)x =o(tx). 


Conversely: Given a map (1) satisfying (i) and (ii), the group G acts on M by 
means of the map 7’: G + S(M) defined by T(o)(x) = ox. Indeed, (ii) says that 
T(ot) = T(c) 0 T(z), and (i) says that T(1) = idy, so T(a) 0 T(o~!) = TI) = 
idy = T(o~!)o T(c). Thus we really do have T(o) € S(M), with T(o)! = 
T(o—!). Either the map G > S(M) or the map G x M — M can be called the 
corresponding action of G on M. 
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Example 1. Let E/K be a Galois extension with Galois group G = G(E/K). The 
group G acts on E via (o, x)» a(x), and it acts on E* likewise. If E is a splitting 
field of a polynomial f € K[X] having degree n and roots a1,...,Qn, the group G 
also acts on M = {a,...,Qy}. 


There are many other examples of group actions, arising in very diverse ways. 
It is a group’s nature, so to speak, to act on something. 


Definition 2. Let G act on M. For x € M, the set 
Gx ={ox|o€G} 
is called the orbit of x (relative to the given group action). Compare Chapter 8, F1. 


F1. Let G act on M. Any two distinct orbits (relative to the given group action) are 
disjoint, and M is the union of all such orbits. 


Proof. For x, y €¢ M, write x ~ y if there is o € G such that ox = y. You can easily 
persuade yourself that ~ defines an equivalence relation on M. The equivalence 
class of x € M under ~ is none other than the orbit Gx of x. This is enough to 
prove the assertion. 


Definition 3. Let G act on M. The action is transitive if it has a single orbit; i.e., 
if, for some (or for that matter, for any) x ¢ M, 


M =Gx. 
See also Chapter 8, F7. 


F1’. Let the group G act ona finite set M. Then 
IM|=SCICl, 
Cc 


where the sum is over all distinct orbits C. 


Remark. The cardinality |Gx| is called the size (or length) of the orbit of x (under 
G). 


Example 2. Let G be a group. Then G acts on M := G via the action (0, T) > oT. 
The corresponding homomorphism T : G > S(G) associates to every o € G the left 
translation Ts corresponding to o, defined by 7,(t) = ot. Clearly, T is injective. 
If |G| =n we have S(G) ~ Sy, so we obtain: 


F2 (Cayley’s Theorem). Every group G of order n is isomorphic to a subgroup of 
the symmetric group Sy. 
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Example 3. Let H be a subgroup of a group G. Then H acts on M = G by left 
translations. The orbit of o € G is 


Ho ={po|pe HA}. 


This is called the (right) coset of o mod H. The set of such right cosets is denoted 
by H\G, and its cardinality by 


G:H=|H\Gl. 


If G is finite, so are H and H\G. Since |Ho| =|A| for all o € G, we see that FI’ 
implies the next result: 


F3 (Euler-Lagrange). For any subgroup H of a finite group G, 
(2) |G| = (G:H)-|H]. 
In particular, the order of H divides that of G. 


Remarks. (a) Of course (2) is also valid for infinite groups, if regarded as an 
equation between cardinals. 


(b) We denote the subgroup {1} by 1. Then |H| = H:1 for any H, and moreover 
(2') G:1=(G:A)(H:1). 


(c) We complete here the proof of Chapter 9, F1, picking up the general (nonabelian) 
case. Given a € G we look at H = (a). Since |H| = ord a, this number divides 
n = |G|, according to F3. In particular, a” = 1. 


(d) For H a given subgroup of G, write 
G/H 


for the set of left cosets oH, for o € G. Then G is the disjoint union of all the 
distinct left cosets mod H. There are as many left cosets mod H as right cosets 
mod H; thus 

|IG/H|=G:H. 


To see this, note that (o,t) ++ ta! also defines an action of H on M = G, which 
associates to each o € HA the right translation by o~!. The corresponding orbits 
are precisely the sets tH. The map o +> 0! of G onto itself yields a bijection 
H\G > G/H, since (Ho)! =o7!H"!=o7'H. 


We call G: H the index of H in G. If G is finite, 
(2”) G:H=|G|:|H| 


by (2); in particular; G: H then divides |G]. 
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Definition 4. Let G act on M and take x € M. The subgroup 

G, ={0€G|ox=x} 
is called the stabilizer of x. 
F4. Let G act on M and take x € M. The map 

G—> Gx, or ox 

defines a bijectioni : G/Gy > Gx. Thus, if Gx is finite, so is G: Gx, and 
(3) |Gx| = G:G,. 
If G is finite, so is Gx, and 


IG]. 


(4) ge lel 
bal area 


in particular then the size of any orbit divides the order of G. 


Proof. Take H := Gx. Then oH = TH if and only if p:=07!t € H. It follows that 
TX =Opx =ox. Thus we get a well defined map i: G/G,, > Gx with i(oH) =ox. 
Clearly i is surjective. But 7 is also injective since the equality ox = tx implies 


o~!tx =x, hence o~!t € H and also oH = tH. 


F5 (Orbit formula). Let the group G act on a finite set M, and let x;,...,Xs be 
representatives for each of the s distinct orbits of the action. Then 


|M| =) 1 (G:Gx,). 
i=1 


Proof. By Fl’, we have |M| = >~;_, |Gx;|. The assertion follows thanks to (3). 


i=1 


Example 4. Let G be a group. As can easily be checked, G acts on M = G via 
the map (o,t) tb ota—!. Let o + Ty be the corresponding homomorphism from 
G into S(G), that is, T,(t) = ata—!. Now, 


To (t%)T2) = o(tt2)o! =oyo long != To (t1)To (2); 
therefore 7, is actually an automorphism of G. Thus we obtain a homomorphism 
T:G—>AuG 


from G into the automorphism group of the group G. Elements of S(G) in the 
image of T are called inner automorphisms of G. The kernel of 7 is the normal 
subgroup of G given by 


ZG :={a| oto != 1 forallte G}; 
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it is called the center of G, and of course can also be characterized as 
ZG = {o |ot = to for all t € G}. 
The stabilizer G; of t with respect to the action under consideration is the subgroup 
(5) Zg(t):= {o | oto! = 7}, 
called the centralizer of t in G. The orbit of t € G, that is, the set 
{oto !|o€G}, 


is called the conjugacy class of t in G. Two elements t and tz in the same con- 
jugacy class are conjugate in G; this happens if and only if tz is the image of tT 
under an inner automorphism: t2 = oto! for some o €G. 


We now apply F5 to the action from the preceding example. We obtain: 


F6 (Class formula). If G is a finite (nonabelian) group and 1, T2,...,T, represent 
each of the conjugacy classes of G that contain more than one element, we have 


|G|=|ZG| + )> (G:Ze(a)). 


i=1 


Proof. The conjugacy class of t € G has a single element if and only if t € ZG. 


Definition 5. Let p be a prime. A finite group G is called a p-group if |G| is a 
power of p. 


F7. Every finite p-group G # | has nontrivial center. 
Proof. By F6, p must divide |ZG|, so ZG # 1. 


F8. If G is a p-group of order p™, there exists a chain 
G=H) >A, > A)2D::-2> An=1 


of normal subgroups H; of G such that 
Aj-;:Hj =p forall \<i<m. 


Proof. Let m > 0. By F7, ZG contains an element a # 1. The cyclic subgroup 
(a) has a subgroup H of order p (see Chapter 9, F6). Since H C ZG, this H 
is normal in G. Now consider the quotient group G = G/H. By induction we 
can assume that there exists a chain G = No DT Nj D--- D Nm—1 = 1 of normal 
subgroups N; of G such that N;-;:N; = p. Each map y; : G > G > G/N; gives 
rise to an isomorphism G/ker yj ~ G/N;. Now set Hm = 1 and H; := ker g; for 
0<i<m-—1. We then get achain G = Hyp D A, D---D Ayn-1 = HD An =1 
of normal subgroups of G. For 0 <i <m-—1 we have, by the definition of H;, the 
equalities G: H; = G: N; = p’, the latter because of Euler-Lagrange, equation (2). 
Again from (2) we then get H;_;: H; = p. From our choice of H this is also true 
for i =m. Thus we have obtained a chain with the desired properties for G. 
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Notation. If H is a subgroup of G we write H <G. If H is a normal subgroup of 
G, we indicate this by writing HIG. 
Example 5. Let G act on M. Then G acts also on the power set of M, via 
(0, X)R aX ={ox|xeX}. 
We have |oX| = |X| for every ¥ CM. 


Example 6. In the preceding example, consider in particular the action of G on 
M =G by inner automorphisms. The orbit of an XY C G is then {0X¥o~!|o € G}. 
We use the notation X° := 07! Xo. If now ¥ = H < G is a subgroup of G, the 
image H° = 0~'Ho is also a subgroup of same cardinality. We say that H® and 
H are conjugate subgroups of G. The stabilizer of H, 


NGH :={0 €G|oHo! = H}, 
is called the normalizer of H in G. By definition, H < Ng H. Moreover, 
(6) HAG = NGH=G. 
By F4, G: NGH is the number of subgroups of G conjugate to H. 


Example 7. Take H < G. Then the action of G on G/H via (0, tH) oTH is 
transitive. The stabilizer of tH is the subgroup tHt~! of G, since for o € G we 
have o(tH) = tH — > oe tH. 


Example 8. In the situation of the preceding example let U < G be another subgroup 
besides H. Then U acts on G/H via (o, tH) otH. The orbit of tH is the set 


(7) {otH |o €U} 
of left cosets mod H; the stabilizer of tH is 
(8) tHt !NU, 
by Example 7. The union of cosets of (7) is the set 
UtH. 
This is called the double coset of t relative to U and H. Clearly, 


(a) G is the disjoint union of all the distinct double cosets relative to U and H. 


If we now assume that G is finite and let m be the size of the orbit (7) of tH, we 
get |UtH| =m-|H|. By virtue of F4, then, m = U : (tHt~! NU); see (8). Thus 
the cardinality of the double coset Ut H is 

U||H 

(9) Cane 

|tHt—! NU | 


Moreover 
|cHc 1 NU|=|Hnt 'Ut| =|, 


as can easily be seen by applying the inner automorphism corresponding to t. Thus: 
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(b) Let G be finite and let T be a complete set of representatives of the distinct 
double cosets of G relative to U and H. Then 


|U| || |U| || 
10 G 
ao lie 2 HAT » Tanoty 


For the rest of this chapter G will denote a finite group and p a prime number. 
Definition 6. (a) A subgroup H < G with |H| = p” is called a p-subgroup of 
G. 

(b) Suppose |G| = p”a with (a, p) = 1, so that p” is the highest power of p 
dividing |G|. A subgroup H < G such that |H| = p” is called a Sylow p- 
subgroup of G. We denote by 

Syl,G 
the set of such subgroups. For an arbitrary subgroup H of G, F3 gives 
(11) HeSyl,G <= > H isa p-group and G: H £0 mod p. 


Example 9. Consider the group G = GL(n, p) = GL(n,F,) of invertible n x n 
matrices with coefficients in the field F,. It is easy to see that G has order 


(12) |G| = (p"—1)(p"— p)...(p" =p"), 


since there are p” — 1 possibilities for what the matrix does to the first element of 
the canonical basis of FF then p” — p for the second, and so on. The highest power 
of p that fits in |G| is thus 


(13) pere) = prev? 


The subgroup 
1 


a ‘ <GL(n, p) 


? 1 


of all upper triangular matrices with | in the diagonal obviously has order (13). 


Thus P is a Sylow p-subgroup of G. 


Lemma. Suppose H < G and let P be a Sylow p-subgroup of G. Then there exists 
t €G such that 
HM P* 


is a Sylow p-subgroup of H. 


Proof. We look at the double coset decomposition of G relative to H and P. By 
(10) we have 


|| |P| 
co ~ Linn PA 4 [HO PF 


100 10 Group Actions 


Dividing this equation by the highest possible power of p, namely p” = | P|, we 
see that for at least one t we must have 
|H| 


1 _ = WH: HM P* £0 mod p. 
[Hn Pt] mee 4 


On the other hand, HM P* is a p-group, since it is a subgroup of P*. Thus, by 
(11), HN P* is a Sylow p-subgroup of H. 


Theorem 1 (Sylow’s Theorems). 

First. G contains a Sylow p-subgroup. Every p-subgroup of G is contained in some 
Sylow p-subgroup of G. 

Second. Any two Sylow p-subgroups of G are conjugate. 


Third. Let np be the number of Sylow p-subgroups of G. Then 
(a) Np divides G: P for P € Syl,G, and 
(b) mp = 1 mod p. 


Remarks. The number vy, actually satisfies 
(a’) Np = G: NG P, and also 
(b’) np = 1 mod p@ for every d such that 


(15) P: POP’ =0 mod at for every P’ € Syl,G distinct from P. 


Now (b) will follow from (b’), since (15) is obviously satisfied for d = 1. 


As a consequence of Sylow’s second theorem, a normal Sylow p-subgroup of G is 
the only Sylow p-subgroup of G. The converse is clear: If G only has one Sylow 
p-subgroup, this group is normal in G, since any conjugate of a Sylow p-subgroup 
of G is also a Sylow p-subgroup. 


Proof of Sylow’s Theorems. (1) Let n = |G|. By F2, G is isomorphic to a subgroup 
of S,. By associating to each x € S, the permutation matrix P,; € GL(n, F,) that 
accounts for the effect of 2 on the canonical basis ¢1,..., én (that is, Pe; = ex(i)), 
we get an injective homomorphism of S, into GL(n, F,). Thus G is isomorphic to a 
subgroup H of GL(n, F,). But for GL(n, F,) we have produced a Sylow p-subgroup 
in Example 9. By the Lemma on the previous page, this means that H, too, has a 
Sylow p-subgroup, and thus so does G. 


(2) Take P € Syl,G and let H < G be any p-subgroup of G. By the Lemma, there 
exists tT € G such that HM P* € Syl, H. But H is also a p-group, so HN P* = H, 
and therefore H C P* € Syl,G. This completes the proof of Sylow’s first theorem. 
Now, if H € Syl,G, it follows that H = P*, since both groups have same order. 
This proves Sylow’s second theorem. 


(3) Take P € Syl,G. Sylow’s second theorem yields np = ee [rE G}| =G:NGP 
(see Example 6). This proves (a’), hence also (a). 
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(4) There remains to prove (b’). To do this consider equation (14) with H = Ng P. 
Then 
PtNGP, forte T, 


runs over all the distinct double cosets of G relative to P and Ng P. We may as 
well assume that 1 ¢ 7. For t # 1 in 7, then, we have t € NGP, so P* F P. 
Being a p-group, P’M Nc P is contained in a Sylow p-subgroup of Ng P, thanks 
to Sylow’s first theorem. But then, being normal in Ng P, the group P must be 
the unique Sylow p-subgroup of NG P, so that P?’M NgP C P. It follows that 
P™*(1 NGP GC P™t 1 P, and hence 


P’ONGP=P'OP. 
Dividing (14) by |H| = |Ng P| we get 


G:NgP=1+ >> (P:P'NP). 
teET \ {1} 


Using (a’) it now follows that np = 1 mod p@ for any d that obeys (15). 


Remarks. Together with F8, Sylow’s first theorem immediately implies that for any 
prime power p* dividing the order of G there exists a subgroup H of G such that 
|H| = p*. For k = 1, in particular, we get a theorem that goes back to Cauchy: For 
any prime p dividing the order of G, there is an element of order p in G. 

The Norwegian mathematician Ludwig Sylow (1832-1918) recognized the 
significance of Cauchy’s Theorem and fleshed it out into the three statements that 
bear his name, which have played, ever since their publication (in 1872 in the 
Mathematische Annalen), a fundamental role in group theory. 

If a finite group G is abelian (or just nilpotent — see §10.19 in the Appendix), 
it is actually the case that there is a subgroup of any order dividing the order of G. 
But not so for general groups; already the alternating group A, has no subgroup of 
order 6 (see §15.11 in the Appendix). 
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Applications of Galois Theory to Cyclotomic Fields 


1. We start by considering, in the light of Galois theory, the general problem of 
constructibility with ruler and compass, which has served as our lodestar, so to 
speak, since Chapter 1. 


Theorem 1. Let M € C contain 0 and 1, and set 
K:=Q(MUM). 


Given z € C, there is equivalence between: 
G) z€ AM (that is, z is constructible with ruler and compass); 
(ii) z is algebraic over K, and the degree over K of the normal closure E/K of 
K(z)/K) is a power of 2. 
Proof. The implication (ii) = (i) is especially interesting, and we prove it first. By 
assumption the extension E/K is Galois, and its Galois group G = G(E/K) is a 
2-group. Thus, by F8 in Chapter 10, there exists a chain 


(1) G=H2M2>...2>H,=1 


of subgroups of G (normal, in fact) such that Hj-1:H; = 2. By Galois theory 
(Chapter 8, Theorem 5) this chain of subgroups Hj of G has a corresponding chain 


(1’) K=KoCKC:::-CK,=E 


of intermediate fields K; of E/K, with K;: Kj-1 = H;-,: H; = 2. By Theorem 1’ 
in Chapter 1, this implies that z¢ AM. 
The implication (i) = (ii) just serves to round out the picture; its proof, though 
elementary, is quite instructive. Taking z ¢ AM, we know from Theorem | in 
Chapter | that z lies in a certain extension K,, of K that can be obtained from K 
by successively adjoining square roots: 


ZE€ Ky, = K(u1,U2,...,Wm), 


where w? € K(w1,..., wj—1) and w; € K(w1,..., wi-1) for 1 <i <m. 
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Let Em/K be the normal closure of Km/K. Then E»/K is a finite Galois 
extension. Clearly, to prove our assertion it is enough to show that FE, : K is a power 
of 2. This we do by induction on m. For m= 1 we have K; = K(w1) with w? eK, 
w, € K; in this case K,/K is normal, so E; = K; and hence E,: K = 2. Now 
suppose m > | and set Km—1 := K(w1,...,Wm-—1). By the induction hypothesis, 
the normal closure E,—1/K of Km—,/K has degree E—,:K equal to a power 
of 2. We have Ky = Km—1(Wm). 

Now let a; = Wm, Q2,...,@5 be the distinct conjugates of w,, over K (in 
C). Then a? € Em—1, since a? is conjugate over K to the we € Ky_-1. Now of 
course Ey, = Em—1(Q1,...,@5), SO Em: Em—1 must be a power of 2. But then 
Em: K = (Em: Em-1)(Em-1: K) too is a power of 2. 


2. We now apply Theorem | to the problem of dividing the circle into n parts. As 
we saw in Chapter 1, the question is to decide, for a given n € N, whether or not 


the complex number 
c = e2til n 


lies in the field AQ. The extension Q(¢)/@ is normal; by Theorem 1, then, ¢ <€ AQ 
if and only if @(¢):Q is a power of 2. By Gauss’s Theorem (Chapter 9, Theorem 3) 
we have Q(¢):Q = y(n), so we must figure out for what values of n the natural 
number (7) is a power of 2. Let 


(2) Pe aren (ed 


be the prime factorization of n, where e is nonnegative, the primes p1,..., py are 
odd and pairwise distinct, and each e; is at least 1. Applying g to (2) and taking 
multiplicativity into account (see (19) in Chapter 9), we get 


y(n) = 9(2°)g(p}')... p(pe"). 


Since y(p”) = (p—1) p™! for m= 1, we see that y(n) is a power of 2 if and only 
if all the e;’s are 1 and p; —1 is a power of 2 for every i. This leads to: 


Theorem 2 (Gauss). A regular n-gon is constructible with ruler and compass if and 
only if 
n= 2° pi pr... Pr, 
where e = 0 is arbitrary and p,,..., p; are distinct primes of the form 
pi=1+2?", 
The only thing in this result that has not yet been proved is: 
Lemma. Form €N, the integer 1 + 2” cannot be prime unless m is a power of 2. 


Proof. Suppose m = m,mz2, with mz > 1 odd. Then 


142" =1-—(-2™)"2 = (14+2™)(1-2 + 2712... 4 gm (ma—1)) 


is composite. 
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Remarks. For & € N U {0}, the number 
3) Fy = 142" 


is called the k-th Fermat number. It is easy to show that 


(4) Fo =3, Fy =5, Fp =17, Fy = 257, Fy = 65537 


are prime. Fermat (1601-1665), the great reviver of number theory in the modern 
era (see Winfried Scharlau and Hans Opolka, From Fermat to Minkowski, Springer, 
1985), occupied himself, in a context totally different from circle division, with the 
numbers that now bear his name; see Pierre de Fermat, (Euvres I, 127ff. He stated 
that they are all primes (Zuvres IL, p. 309). But already the next Fermat number, 


Fs = 1+ 237 = 641-6700417, 


is composite, as Euler (1707-1783) found out and showed more or less as follows: 
Because 641 = 5-27+1=54 +24, there is a congruence 


23? = 24.(27)4 = —54.(27)4 = —(5-27)* = —1 mod 641. 


Fermat’s mistake is noteworthy because he himself had found counterexamples of 
the caliber of Euler’s in the case of the Mersenne numbers 2? — 1; but he seems to 
have relied on Frenicle, who apparently agreed with the claim (Fermat, uvres II, 
p. 208). 

Today we know that F, is composite at least for 5 < k < 32, and we don’t 
know whether there are any prime Fermat numbers after those in (4). (Note that 
F\7 already has 39457 decimal digits, and F'32 has over a billion.) 


It is easy to see from the definition (3) that Fy4,—2 = (Ff, —2) Fx; by induction, 


(5) Fm—2= || Fe. 


0<k<m 


Thus n = Fs —2 = 237-1 = FoF, Fo F3F,, is the largest known odd number of 
sides that a constructible regular polygon can have; its decimal representation is 
4294967295. 

Incidentally, (5) implies that any two distinct Fermat numbers are relatively 
prime. Thus there appear infinitely many primes in the factorization of Fermat 
numbers. 


3. Given a prime number p # 2, we now look at the field Q(¢,) of p-th roots of unity 
over @ (where ¢, denotes a p-th root of unity 4 | in C, for example ¢, =e?” /P) and 
ask what square roots of nonzero rational numbers lie in this field. For d,, dy € Q”, 
the extensions Q(./d;) and Q(./d2) coincide if and only if dj = x*d>, for x € Q™. 
Thus we should investigate quadratic subfields of E = Q(¢)), that is, subfields F 
of E such that F:@ = 2. Now, E/Q is a Galois extension, with Galois group 
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G = G(£E/Q) isomorphic to (Z/ p)*; see Chapter 9, Theorem 3. The group (Z/ p)” 
is cyclic of order p—1, by Theorem 2 in Chapter 9. Thus for any factor t of p— 1 
the group G has a unique subgroup of order ¢ (Chapter 9, F6); then by Galois 
theory (Theorem 5 in Chapter 8) E has exactly one subfield F such that F: F = tf. 
Setting ¢ = (p — 1)/2, we conclude that E = Q(¢,) has a unique subfield F with 
F:Q =2. In other words: There exists a unique square-free integer d # | such that 
Vd € Q(%). 


What then is this number d, as a function of p? 


Notation. For a prime number p + 2, set 


eo p for p =1 mod 4, 
(6) P =| for p = 3 mod 4. 


If desired, this can also be written as 
(6') die a a 2 


Theorem 3. For every prime p # 2, the extension Q(./p*) is the unique quadratic 
subfield of Q(fp), and thus p* is the unique square-free integer (apart from 1) such 


that ,/p* € Q(ép). 


Proof. As seen earlier, @(¢,) has a unique subfield F such that F:Q = 2, and 
F = Q(V4d) for a unique square-free d = d(p) € Z. Instead of finding d “naively” 
(but see the remark in the proof of F1 below), we draw on F8 of Chapter 8, which is 
perfect for the situation. According to that result, F = Q(/D), where D = D(f) 
is the discriminant of the p-th cyclotomic polynomial 


(7) f(X)H=14- X44 X72 4-.-4 HR 7, 


Then d(p) is simply the square-free part of D(f). So all we have to do is compute 
the discriminant D(/); the result is 


(8) Df) = (NPY? pP, 


according to the following lemma. Thus, as asserted, 


d(p) = (-1)@-P? p = p*. 


Lemma. For p # 2 prime, the discriminant of the polynomial f = F, is given by (8). 


Proof. For simplicity we write ¢ for ¢). Substituting X = 1 into the polynomial 
f(X) = Ee (X —¢*) and taking (7) into account we get 


p-i 
(9) []a-s)=.. 
k=1 
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By (17) in Section 8.3, we have 
p-l 
(10) Df) =[[C-0) =] ]/'). 
i#i i=1 
where ¢ = (—1)"@—)/2 = (—1)(-)/2,, Since X? —1 = f(X)(X — 1), this gives 
pert = f'(OE-1). 


Now taking the product over all conjugates, and applying (9) and (10), one obtains 
what is needed: 


p?'=eD(f)p. 


We now wish to study more closely the action of the Galois group G on the 
quadratic subfield F of Q(¢) (where = ¢, as above). By Theorem 3 in Chapter 9, 
there is first of all a natural isomorphism 


(1) (Z/p)* > G = G(Q(é)/Q), 


sending each element a mod p of (Z/p)”* to the automorphism og characterized by 
(12) oa(S) = 6". 

For each o € G, then, we have 

(13) a(/p*) = x(a) /P*, 


with a well defined y(o) € {1,—1}. The map x : G > {1, —1} is of course a group 
homomorphism; it is called the sign character (of p). 


Definition. For a € Z such that (a, p) = 1, set 
a 
(14) (5) = x(oa). 


This is called the Legendre symbol of a (relative to p). The element ($) of {1,—1} 
is thus fixed by the equation 


a 
(15) oa(VP*) = (=) VF. 
As already observed, 
ab a\(b 
~ (a) =a) 
By definition, H := ker x is the subgroup of G associated by Galois theory to 


the subfield F = Q(./p*). But since G is cyclic and of order p—1, it has only one 
subgroup of index 2, namely the group 


(17) H={t?|rteG) 
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of all squares in G (Chapter 9, F6). This, in view of the isomorphism (11), implies 
that 


(18) () =1 < > amod p isa square in (Z/p)*. 
We see, then, that the Legendre symbol (14) has an elementary number-theoretic 


description: 


(19) (<) _ 1 if ¥? =a mod p has a solution in Z, 
Dp ~ \-1 if X? =a mod p has no solution in Z. 


(The choice of 1 and —1 instead of 0 and | to label the branches of this dichotomy is 
made so that (16) works.) Since H in (17) is the unique subgroup of order (p—1)/2 
in G is, we obtain Euler’s criterion: 


(20) (5) =1 — > g?-Y? =1 mod Dp. 
Substituting a = —1 (and considering that then, since p 4 2, congruence implies 
equality), we obtain 
ST sa ee 1 for p=1 mod 4, 
@)) (=) ar ~ )-1 for p =3 mod 4. 


We know that 1,¢,...,¢?~? form a basis of the Q-vector spaces Q(¢), since 
the minimal polynomial F, of ¢ over @ has degree p—1. Multiplication by ¢ 
transforms this basis into the basis 


(22) CO elo 


which is ideal for Galois-theoretic considerations, consisting as it does of all the 
Q-conjugates of the one element ¢. Now ./p*, like any other element of @(¢), has 
a unique representation in the form 


(23) Jp* = do ago(f) witha, € Q. 


ocG 


Applying some t € G we get x(t),/p* = >>, aota(¢); this yields a relation 
x(t) a; =a,-1 between coefficients. Thus 


dg =41X(0) foranyo eG. 


Then (23) becomes 


(24) VP* =a: x(a) o(6). 


ocG 


More can be said: 
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F1. With the preceding notation, 
(25) VP*=+)>° x(o)o(6). 
ocG 
Thus, in terms of the Legendre symbol (see (14) and (12)), we have 


p-l 


(25') d (FS) = ev0%. 


a=1 


Proof. We must show that the rational factor a; in (24) satisfies at = 1. There 
seems to be no tidy conceptual argument at hand to let us get around the dirty work. 
So we make a virtue of necessity and determine the square of 


(26) a:= S° x(a)o(6) 


by a direct computation — whereby we incidentally establish again that the quadratic 
irrationality ./p* does belong to Q(¢,). Observing that x(a) = x(o—'), we have 


a =S° xo) x(r)o(O) 1H) = D> xo to (Er), 


OT OT 


Let p=oa7! 


we get 


T, SO T = op. Since a sum over all (0, T) is also a sum over all (0, /), 


a =P x(p)a(Sop(f) = >> x(p) Ya (Ep). 


p,0 p 


We can evaluate the rightmost sum by observing that €o(¢) is a p-th root of unity, 
primitive unless p is the particular automorphism o_; defined by (12). For a p-th 
root of unity 7, 


_ 2 p-1__Jp-l ifn=1, 
Ylon=ntyt+--tn =| ee 
= 1 ifn. 


Applying this to fo(¢), we get 
a? = x(o-1)(p—1)— D> x(p) 
p#o-1 


= x(0-1)(p-1) + x(0-1) — 9) x) = Px(o-1), 
p 


where ae xX(e) vanishes because ker x has index 2 in G. From (14) and (21) we 
know that 


X(o-1) = (=) = (-1)@-D/2, 


so the desired equality w? = p* follows. 
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Remark. In the preceding discussion .//p* denoted, as usual for algebraic purposes, 
an arbitrary (but fixed) solution of the equation X¥* — p* = 0 in C — which of the 
two is immaterial. In this light, the occurrence of the + in (25) and (25’) is not 
surprising. From the analytic viewpoint, however, it is relevant to ask which sign is 
the right one, if one understands by ./p* the positive square root ./p in the case 
p* = p and the complex number i ,/p in the case p* = —p. For a given p, the 
factor aj = a1(¢) € {1,—1} that appears in (24) depends at most on the choice of 
¢; that it does so depend is easy to see because when one replaces ¢ by another 
primitive p-th root C* the sign behaves as follows: 


Q7) a(t) = (2) a, 


From the point of view of analysis there does exist a canonical choice for ¢, namely 
¢ =e?7'/P, and the question then is which sign is appropriate for this choice. Gauss 
agonized over this problem for years, as he himself confessed, until he finally found 
the answer in 1805: the sign is always +1, independently of p. Readers interested 
in knowing the proof can find a matrix-theoretic one by Isaac Schur in his Werke, 
vol. II, and a function-theoretic one by Carl Ludwig Siegel in Chandrasekharan’s 
Introduction to Analytic Number Theory. 


In pursuit of explicit applications of Galois theory, we have wandered unawares 
into the realm of number theory. We might as well go one step further and prove 
the famous quadratic reciprocity law of Gauss. Without having to fuss about the 
exact sign determination of the Gaussian sums (26), we already have in hand a key 
to the problem, in the form of the relation a? = p*. 

Let g be any prime distinct from the original prime p. It will prove convenient 
to work in the ring R = Z[€] (still with ¢ = ¢,). For an arbitrary element of R, say 


a= Y > aoo(S) with dg € Z, 


(on 


we have ‘i 

ag{a) = Yarote)"' = (Yaeots)) moda. 
and hence : ° 
(28) Oq(a@) =a? mod gR. 


Applying this to a = ,/p* given by (25), one gets, using (15), 


(29) (4) vr = (/p*)* mod gk. 


After multiplying by ./p*, and assuming g 4 2, we get 


(F)e" = (p*) Et D2 mod gR. 
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Since p* and q are relatively prime, p* is invertible modulo qg in Z C R, so 


(30) (5) = (p*)9-)/? mod gR. 


But gZ[¢]N Z = qZ (as can be seen easily by comparing coefficients relative to the 
basis 1,¢,...,¢277); thus (30) says simply that 


(4) = (p*)9-)/? mod qZ. 
P 
Using Euler’s criterion (20) we then obtain 
* 
(4) = (7) mod gZ. 
P q 
Since gq # 2 this finally yields 
Chis, 
P q/ 
since the only values that can occur are | and —1. To summarize: 


Theorem 4 (Quadratic reciprocity law). If p and q are distinct odd primes, 


a ()=(2) 


which, taking into account (6'), (16) and (21), also means that 


oy 3(p-1) 5-1) (P 
(32) (4) = p2e-vian (2). 
P CD q 
In other words: 
(a) If p or q is congruent to 1 mod 4, then (4) = (4), that is, q is a quadratic 


residue mod p if and only if p is a quadratic residue mod q. 
(b) If both p and q are congruent to 3 mod 4, then (Z) = -(4), that is, q is a 
quadratic residue mod p if and only if p is not a quadratic residue mod q. 


Equation (32) is completed by two “supplementary laws” for quadratic residues: 
For any odd prime p, 


= | 
(33) (=) = (1) 2-D/2 
and 
2) @ (y\?=)/8 1 for p=1or —1 mod 8, 
ve Gir = ~ |=1 for p =3o0r —3 mod 8. 


Proof. All that remains to prove is (34). To do this we work in the field of eighth 
roots of unity over Q. Choose 


(35) ae ile 
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so 
(36) V2=640", 
and with this one gets a formula analogous to (25). Now, if €, is the sign such that 
(37) 6 (V2) = epv2, 
then the relation parallel to (29) is 
Epv2 = /2” mod DP; 


and one uses Euler’s criterion exactly as above to obtain 


(38) ep = (=). 


But since o,(¢) = ¢? we have 
Ep =1 => 0p(V2) = V2 > C407 H=E4 01 => p=lor—1 mod 8, 


since for the remaining p’s we have €? + €-? =—(€+¢—!). Equation (34) follows. 


Example. Is 221 a quadratic residue modulo the prime 383? To apply Theorem 4, 
we must first factor 221 into primes. Then 


(5a) = (sus) (aes) = G5 )(F7) = (3) (G9) = Ga) 
-(B)B)-8)--@)--Q->: 
so the congruence X? = 221 mod 383 has no solution in Z. 


Remarks. (a) It is clear that the algorithm illustrated by the preceding example 
will always succeed in determining whether a fully factorized number is a quadratic 
residue. It is possible to avoid the heavy computational burden of a preliminary 
prime factorization (apart from factoring out 2’s). To do this, one extends the 
Legendre symbol into the Jacobi symbol, by setting 


(39) (F):= I Cr 


whenever a, b are relatively prime integers such that a £ 0 and b is odd and positive. 
Using Theorem 4 one can easily prove (exercise) the relations 


a G)=cnternse-n(g) 
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where a is also assumed odd and positive, and 
—-1 _ 2 2_ 
ao Cy al Se a 


Since clearly (39) depends only on a modulo b, the Jacobi symbol achieves what is 
desired. For example, although 1363 = 29-47 is not a prime, we can write, letting 
b be the Fermat prime 65537: 


(e335) = (tax) = (Hs) = (G) = 


Thus 1363 is a quadratic residue mod 65537. 


(b) The deeper meaning of the quadratic reciprocity law, which far transcends its 
nice algorithmic handiness, is beyond our scope. Nonetheless, there is an important 
fact that we can touch upon, and which can be read off from the properties (40) and 
(41) of the Jacobi symbol, namely: The value of (¢) depends on b €N only modulo 
4a (and if a = | mod 4, then ($) depends on b only modulo a). For a given integer 
a, then, we have: If p and p’ are odd primes not dividing a, then 


(<) = (4). if p = p’ mod 4a. 


Thus the quadratic residue behavior of a modulo p is the same for all primes p 
that belong to the same residue class mod 4a. This implies (though we cannot go 
into it here — see F. Lorenz, Algebraische Zahlentheorie, BI-Verlag, 1993) that the 
decomposition behavior of primes p in the quadratic number field Q (Ja ) depends 
only on p modulo 4a. (In the case a = —1, compare with §4.18 in the Appendix.) 


(c) A word on the history of the quadratic reciprocity law. In 1875 Kronecker 
(see vol. II of his Werke) called attention to the fact that the law was first stated by 
Euler — as far back as 1744 in its essentials, and in much more developed form again 
in 1783. Thus Euler preceded Legendre (1785) and Gauss (1801), who seem to have 
overlooked his remarks. However, the indisputable merit of finding a real proof for 
what was until then a heuristic observation belongs to Gauss; see his Disquisitiones 
Arithmeticae, where he in fact makes some historical remarks as well (articles 151, 
296, 297 and Addenda at the end). 


(d) The quadratic reciprocity law is a mathematical gem in itself. It has also 
proved to be a landmark along a road which, in the realm of algebraic number 
theory, has led toward fuller awareness of class regularity for abelian number 
field extensions, an awareness that has found deep expression in Artin’s reciprocity 
law. (Again, see my Algebraische Zahlentheorie.) Furthermore today a trail is 
being blazed past the abelian case (see the survey article by J. Neukirch in Ein 
Jahrhundert Mathematik, 1890-1990: Festschrift zum Jubilaum der DMV, Deutsche 
Mathematiker-Vereinigung and Vieweg, 1990). 
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Further Steps into Galois Theory 


1. What happens to the Galois group of a polynomial when the ground field is 
extended? Let f € K[X] be separable and let G be the Galois group of f over K. 
If K’ is any extension of K, it is easy to see that the Galois group of f over K’ 
can be regarded as a subgroup of G. More generally: 


Theorem 1 (Translation theorem). Let E/K be a Galois extension and K'/K any 
field extension. Assume, without loss of generality, that E and K’ are subfields of a 
field C, and let EK’ = K'(E) be their composite in C. 

(a) EK’'/K’ is a Galois extension. 

(b) The map G(EK'/K') — G(E/K) defined by restriction gives rise to an iso- 

morphism 
G(EK’/K') ~ G(E/EN K’). 

The Galois group G(E K'/K’) of the extension E K'/K' can thus be identified with 
a subgroup H of G(E/K), namely the subgroup corresponding to the intermediate 
field EQ K’' of E/K. 
Proof. Clearly E K’/K’ is algebraic and separable. Because E/K is normal, E is the 
splitting field of some set M C K[X] of polynomials over K. Thus EK’ = K’(E) 
is the splitting field of M C K’[X] over K’, meaning that E.K’/K’ is normal. Thus 
EK’/K’ is Galois. Obviously, the restriction 


+: G(EK'/K’) > G(E/K) 


Ot?OE 


() 


is a homomorphism. If o € G(EK"'/K’) acts trivially on E, it acts trivially on 
EK’ = K'(E), since o acts trivially on K’ by definition. Thus r is injective. 

There remains to show that the image of r equals G(E/EN K’). Let H be this 
image. Since every o € G(EK’/K’) fixes K’ pointwise, K’/N E is contained in the 
fixed field E” of H in E. Conversely, anything in E is left fixed by all elements 
of G(EK'/K’), and therefore lies in K’. Therefore 


(2) E# = K'NE. 
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If E/K is assumed finite, this equality implies the desired conclusion that H = 
G(E/K'N E), by Theorem 5 in Chapter 8. For the case of an infinite Galois 
extension E/K, readers might try to give a justification on their own. It will soon 
become apparent how nice it would be to have some appropriate generalization of 
the Fundamental theorem of Galois theory (Chapter 8) applicable to infinite Galois 
extensions. We will get to that in Section 12.4. Given the results there, the conclu- 
sion that H = G(E/K’ 1 E) easily follows (see Remarks on page 130). 


The state of affairs described by Theorem 1 can be conveniently visualized 
through a diagram: 


. a 7 
(|! J 


K 


F1. In the same situation as Theorem 1, assume further that E/K is finite. Then 
(4) EK':K’ divides E: K. 


Proof. By Theorem 1, the degree EK’: K’ coincides with the order of the subgroup 
G(E/E 2 K’) of G(E/K), and so divides E: K. 


Remark. Relation (4) need not hold when £/K is not Galois. For instance, consider 
K=Q, E=Q(¥2), K’=Q(&¥2), 

where C3 denotes a primitive third root of unity. It is easy to see that EK’ = 

Q(4/2, 6), E:K = K':K =3, and EK’: E =2, so EK’: K’ =2. 


F2. Let E,/K and E2/K be Galois extensions. Assume, without loss of generality, 
that E, and E4 are subfields of an algebraically closed field C, and let E, Ez be 
their composite in C. Then: 


(a) EF, E2/K is a Galois extension. 
(b) The group homomorphism 
h: GE, E2/K) > G(E1/K) x G(E2/K) 
Ot (C£,,0E) 
is injective. If Ey 0 Ex = K, the map h is an isomorphism. 
Proof. (a) Clearly E; E2/K is separable; see Chapter 7, F7. 


Let 0 : Ej Ex > C be a K-homomorphism. For i = 1,2, we have o( Ej) C Ei, 
since E;/K is normal; there follows o(£, £2) C E, E2, so E, E2/K is normal. 
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(b) If o € G(E; E2/K) acts as the identity on both £; and Eo, it does the same 
on E£, EF . Therefore h is injective. Now assume EF, M Ey = K. By Theorem 1, 
there exists for any 0; € G(E;/K) some p € G(E, E2/Ez2) such that o; = pgz,. 
Likewise, for any 02 € G(£2/K) there exists some t € G(E; E2/E1) such that 
02 =TE,. Seto = pt. Then og, = Pe, TE, = PE, = %1, 80 OF, = 04; similarly, 
OE, = 02. This shows that / is surjective. 


Remark. Let things be as in F2, and set L := E, 1 E>. Clearly L/K is normal; 
let A = G(L/K) be its Galois group. If pj : G(E;/K) — G(L/K) is the canonical 
map, for i = 1, 2, define 


G(E\/K) xa G(E2/K) := {(01, 02) € G(E1/K) x G(E2/K) | pio = p02}. 


One easily persuades oneself that the image of the homomorphism h defined above is 
precisely the subgroup G(E1/K) xa G(E2/K) of G(E1/K) x G(E2/K), and thus 


yields an isomorphism 
(5) G(E| E2/K) ~ G(E1/K) xa G(E2/K). 


Indeed: For h(a) = (o£,,0z,) we have (oz,)_ = Oo, = (C£;)L, So h(o) lies in 
G(E,/K) xa G(E2/K). Conversely, if (0,02) is an element of this product, we 
find ne ano € G(E, E2/K) such that o7, = (01), = (02). Then, fori = 1,2, we 
have On o; € G(E;/L); but by F2, part (b), G(E1 E2/L) > Can) x G(E2/L) 
is surjective, so there exists t € G(E, £2/L) such that tz, = = 07) o;. Now the 
element o := oT Satisfies Gg, = 0; for i = 1,2. 


2. We now discuss a result of R. Dedekind that is of fundamental importance to 
field theory. 


Theorem 2 (Linear independence of field homomorphisms). Let E/K be a finite 
separable field extension of degree n, and let C be an algebraically closed extension 
of K. Denote by 0,,02,...,0, the distinct K-homomorphisms of E in C. Then 


01,02,...,On are linearly independent over C; that is, for any c1,C2,...,Cn inC, 
the condition 

(6) Y > cjo;(B)=0 forall Be E 

implies c, = C2 = ++: = Cy = 0. 


The following demonstration is somewhat redundant because this theorem is 
contained in a result shortly to be stated (Theorem 2’), for which we give a different 
and simple proof. But the method of this first proof is of great intrinsic interest. 


Proof of Theorem 2. Let B1,..., Bn be a basis of E/K. Clearly, (6) is equivalent to 


(7) >" ci0;(8j))=0 forl <j <n. 


i=1 
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The assertion of Theorem 2 is thus equivalent to the nonvanishing of the determinant 
of the n xn matrix (0;(f;));,;. Set 


(8) A(Bi...., Bn) = det (0;(B;))i,;- 

So what we have to show is that for some (and therefore for every) basis B1,..., Bn 
of E/K we have 

(9) A(B1,..-,Bn) #9. 


Since E'/K was assumed separable, the primitive element theorem (Chapter 8) yields 
an a € E such that E = K(q@), and we can consider the particular basis of E/K 
given by 


(10) Peis Ho PS xia. 


Then o;(6;) = o;(a/~!) = oj(a)/~!. Thus A(1,a,...,a@"~!) = det (a; (a@)/~!);,; 
is a Vandermonde determinant (see LA I, p. 155), and as such it evaluates to 


(11) A(1,a,...,0"7!) =] | (a;(@) —o(@)). 
i<j 
But since 01,..., 0 are all distinct and w generates E over K, the 0(@),..., On (a) 


are all distinct. Thus the particular basis (10) does satisfy (9). 
As mentioned, Theorem 2 also follows from the next result: 


Theorem 2’ (Artin). Let M be a monoid and F a field. If 01,...,0n are pairwise 
distinct homomorphisms of M into the multiplicative group F™ of F, theno,,...,On 
are linearly independent over F. 


Proof. The proof rests on a simple fact from linear algebra: 

Let B be a set of endomorphisms of an F-vector space V. Let v1,...,Un € V be 
simultaneous eigenvectors of all the B € B; that is, suppose that for each B € B and 
each | <i <n there is a unique 4; (B) € F such that 


B(v;) = Ai (B) V3. 


If the functions 41, ..., 4: B— F are all distinct, the vectors v1, ..., Un are linearly 
independent. 

To see this, consider a nontrivial linear dependence involving as few v;’s as 
possible; write it (after renumbering if necessary) as v; = a cju;, with k > 1 
and c; £ 0 for each i. Then 0 = Bvy—A1 (8) v1) = par Cj (Ai(B)—A1 (B)) v;, leading 
to a shorter linear dependence if we take 6 such that 4(8) #A,(6). Contradiction. 

Now, to see why Theorem 2’ follows from the linear algebra statement, consider 
the F-vector space V = F™ of all maps from M into F. For every B € M we 
take an endomorphism of V—also denoted by 6 —as follows: For any 0: M > F 
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in V, the image B(o): M —> F is the map y  o(fy). Then, for each o; in the 
statement of the theorem, we have B(0;)(yv) = 0; (By) = 0;(B)0;(y), so 


B(oi) = 0; (B) oj. 


Therefore each o; is an eigenvector of every 6, with eigenvalue o;(6) —here we 
used the fact that 0; 4 0. Since the maps o1,..., 0, were assumed to be all distinct, 
the linear algebra statement applies, with o; playing the role of both A; and v;. This 
proves that the o; are linearly independent. 


Remark. Let E/K be a Galois extension of degree n, with Galois group G. By 
extension of the field of constants, we can make the tensor product E @x E into a 
module over the group algebra EG in a natural way: 


(12) co(x@y)=aox @cy. 


(For the definition of the group algebra see Section 6.2.) Now let 01,...,0, be 
the m distinct elements of G. Consider the homomorphism of EG-modules f : 
E®@xKE — EG such that 


n 
f(x®@y) =) GO) yar. 
i=1 
We claim that f is an isomorphism, so we get a canonical E G-module isomorphism 
(13) E@xE~ EG. 


To prove this it is enough to show that f is injective, since the E-vector spaces 
E @xE and EG both have dimension n. Let 61,..., By be a basis of E/K. Every 
element z of E @x E can be expressed (uniquely) in the form 


n 
o> 7B} @c; withe; e E. 


j=l 
Now, if f(z) = 0, we have 
0= (Lor Bea) =o | or Bic) on 
j=l Si=1 i=1 \j=1 


There follows 
n 
So o7'(Bj)¢7 =0 for alli. 
j=l 
But in the proof of Theorem 2 we saw that the matrix (o;'(B a) j cannot be 
markrightThe existence of a normal basissingular; consequently all the cj vanish, 
proving the claim. 
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It is worth remarking that the isomorphism statement (13) can be viewed as a 
deep reason why Galois theory works. In this connection, E. Artin has shown that 
Theorem 2’ can serve as the starting point for a logical treatment of Galois theory; 
see E. Artin, Galois theory, Notre Dame mathematical lectures, 1942. See also the 
hint to §12.6 in the Appendix. 


3. We now come to a beautiful and momentous theorem of Galois theory, which 
actually amounts to a strengthening of the isomorphism statement (13), as will be 
explained more precisely later. 


Theorem 3 (Existence of normal bases). Let E/K be a finite Galois extension with 
Galois group G. There exists an element a in E such that the family 

(14) (O(a) )oeG 

is a basis of E/K. Such a family is called a normal basis of E/K. 

Remarks. (i) Any t € G permutes the elements of a normal basis and is uniquely 
determined by this permutation. 


(ii) If (o(@))¢ is a normal basis of E/K, a minute’s thought shows that E = K(a); 
in other words, q@ is a primitive element of E/K. 


(iii) Let p be prime and let ¢, denote a primitive p-th root of unity (in C). Then 
a := Cp gives rise to a normal basis of Q(¢,)/Q. Indeed, as we know, the Q- 
conjugates of ¢, are 


(15) boemeae. 


but since 1, fp, ee ho s form a basis of Q(¢,)/Q, so do the elements in (15). 
The primeness of p is essential; the corresponding statement for, say, the field Q(i) 
of fourth roots of unity would be false, since 7,—i are linearly dependent over Q. 


Proof of Theorem 3. We first take an arbitrary element a of E and assume there is 
a relation 
y aco(a)=0, withag Ee K. 


ocG 
For any t € G we can apply t~! to the sum, obtaining 
Ss dg t lo(a@) = 0. 
ocG 


From this we see that to force all the ag to vanish, it suffices to ensure that 


(16) det(t~'o(@)), seg #9, 


so our task is to prove that there exists ~ € FE with this property. By the primitive 
element theorem there exists B € E such that E = K(8). Then 


f(X) = [ [ (X -of) 


ocG 
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is the minimal polynomial of 6 over K; see Chapter 8, Fl. For each o € G, consider 
the polynomial 


fie OD) | 
g(xX)= Yop € E[X]; 
then 
(17) g°(B)=0 for o #1, but 


g°(8) 40 for o=1. 


Now let d(X) be the determinant of the matrix 


t lo L(X) 
Neg = (pS) € MELD. 
(8° "eo = (Ga aB) eg & MnELD 
When we plug X = f into this polynomial matrix we get a diagonal matrix whose 
diagonal entries are equal and nonzero — all of this by (17). Taking the determinant 
we get d(B) #0, so d(X) cannot be the zero polynomial: 


(18) d(X) £0. 


Now assume that the field K has infinitely many elements; then, by (18), there exists 
y in K such that d(y) 4 0. So, for such a y, the matrix 


( fy) ) = (=o fy) )) 
—1 

yY-Tt oB 4:0. y= 1,0 

has nonzero determinant; thus the element 


iced WO 
roe 
satisfies the desired condition (16). The existence of a normal basis is proved in the 
case where the ground field K is infinite.! 
In the case of finite fields we must resort to a different argument. In this case 
Galois groups are necessarily cyclic (see Theorem 4 in Section 9.5). The finiteness 
of K does not come in other than via this fact; for this reason we may as well 


assume simply that 
G = G(E/K) is cyclic. 


So let o be a generating element of G and let n = E: K be the order of o. We 
regard o as an endomorphism of the K-vector space E,, and show that the minimal 
polynomial of this endomorphism is X¥” — 1. Since o” = 1, clearly o is a root 
of X¥”—1. On the other hand, o cannot be a root of a nonzero polynomial of 
smaller degree (over K), because this would amount to a linear dependence relation 
among the automorphisms 1, o, o?,...,077}, contradicting Theorem 2’. Now, the 

! It should be clear how the proof needs to be modified in order to obtain an @ that gives rise 

to anormal basis of E/F for every intermediate field F simultaneously. 
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dimension of the K-vector space E is n; therefore the minimal polynomial and the 
characteristic polynomial of o coincide. But then there exists a cyclic vector for o 
(see LA II, p. 168), which is to say some 6 € E such that 


(19) B,oB,o7B,...,0" 'B 


form a basis of E/K. By definition, the elements (19) form a normal basis of £/K. 


The existence theorem for normal bases can also be expressed as follows: 


Theorem 3’. Let E'/K be a finite Galois field extension with Galois group G. The 
group algebra KG and E are isomorphic as KG-modules: 


(20) E~ KG as KG-modules. 


Proof. Every element x of KG has a unique representation 
x= Vidgs. 
ocG 


E has a canonical KG-module structure given by the map 


(Xaea.e) > Yao). 


Like any ring, KG is a module over itself via the map (x, y) + xy. Now, for any 
a € E, the map 


(21) Yo doo = Yi dgoa 
oO oO 


is obviously a KG-module homomorphism from KG into E. Conversely, given a 
KG-module homomorphism g: KG — E, set w:= y(1); then g must have the form 
(21). But clearly (21) is an isomorphism if and only if the images ow of theo €G 
make up a basis of E/K. Thus we have shown the equivalence between Theorems 
3 and 3’. 


Remarks. (1) Let E/K be an extension of finite fields. Since we know certain things 
about finite fields (for instance, that the multiplicative group E™” of E is cyclic), it 
does not seem totally unreasonable to ask whether one can exhibit more concretely 
a normal basis for E/K. It is true that the author has never come across a solution, 
but the problem is hereby posed anyway (see also §12.3 in the Appendix). 


(2) The existence theorem for normal bases strengthens the earlier statement that 


(22) E@xE~EG 
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as E'G-modules (see (13) in the remark following Theorem 2’). This is clear, 
because the existence of a normal basis for E/K implies, as we have seen, the 
isomorphism of KG-modules 


(23) E~ KG; 


by tensoring one then gets E @x E ~ KG @®xKE ~ EG, which is (22). Note, 
however, that in contrast with (22), the isomorphism (23) is not canonical. 


(3) The existence proof we gave for normal bases is less than fully satisfying, in 
that it requires separate treatment for finite and infinite ground fields. It would be 
much nicer anyway to be able to derive the existence of the isomorphism (23) from 
the canonical isomorphism (22). This turns out to be possible, as proved by M. 
Deuring. More precisely: 


Let M and M' be modules over a K-algebra A, both finite-dimensional over K. 
If for a finite field extension E/K there is an isomorphism M @x E =~ M' ®x E of 
A ®x E-modules, then there is an isomorphism M ~ M' of A-modules. 


Sketch of proof. Every A-module M with M: K <oo is of course the direct sum of 
directly indecomposable submodules. By a fundamental theorem of Krull, Remak 
and Schmidt (which we will prove in Volume II, Chapter 28), such a decomposition 
is unique up to isomorphism and reordering of the indecomposable summands. Now, 
if we assume that M @x E and M’ @x E are isomorphic as A ®@ x E-modules, they 
are also isomorphic as A-modules. But as A-modules they also clearly satisfy 
(setting n = E: K) 


M®xE~M", M'@®xeE~M", 


and so also 
M"~M"". 


Applying the Krull-Remak—Schmidt Theorem we get M ~ M’. 


Definition. Consider a finite Galois extension E/K with Galois group G. Let H 
be a subgroup of G and let F be the corresponding intermediate field of E/K. For 
x € E, set 


(24) Try(x) = )> ox. 


ocH 


Clearly Trx#(x) is invariant under all the t € H, and so lies in the fixed field F 
of H. Thus we get a function Try : E — F, which we denote also by Trg;r and 
which we call the trace with respect to H (or to E/F). 


F3. Let E/K be a finite Galois extension with Galois group G. For a given subgroup 
HT of G with fixed field F, let 0, H,...,0mH be the distinct left cosets of G with 
respect to H (som=G: H and an element of G lies in o; H if and only if it coincides 
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with o; on F). If for some a € E the conjugates oa, foro € G, form a normal basis 
of E/K, then the elements 


(25) oo, Trejo, F(@), 05 Tre/o,F(@), ---, Om TTE/omF(@) 


form a basis of F/K. If H is a normal subgroup of G (equivalently, if F/K is 
normal), the basis given by (25) is a normal basis of F/K. In any case we have 
F= K(Trg/F(@)). 


Proof. A given x € E has a unique representation 
x= Sy dgoad withd, € K, 
ocG 
and it lies in F if and only if it is invariant under every t € H. But 


x= y Agloa = y Az-1p Pa = ) d,-1,00, 


ocG pEG oc€G 


so tx = x for every t € H if and only if a,-1, =a, for every t € H and every 
o €G, that is, if and only if @ is constant on every right coset Ho. The elements 
1 


Oy ress i each represent a different right coset of G modulo H, so x lies in F 


if and only if x is of the form 


m 


x= Ya s rota), with a; € K. 


i=1 teH 


Notice here that to; = 0; : (o;t0;') and that the subgroup o; 1 ee is associated 


with the fixed field 0; F, so we finally get, as a necessary and sufficient condition 
for x to lie in F, that x be of the form 


m 
x= y\ aio; " TrE/o;F(@), witha; € K. 


i=1 


Such a representation is unique by construction, so we have in fact shown that the 
elements listed in (25) form a basis of F/K —note that they are indeed in F, since 
0; | Trejo, F(a) € 0; (0; F) = F. 

If H is a normal subgroup we have o; F = F, so (25) lists precisely the conju- 
gates of Trgz/F(a). 

As for the last assertion of F3, one easily sees that in any case oj Tre;r(a) # 
oj; Trg/F(@) fori # j; thus Trz/r(@) has at least m = F': K distinct conjugates in 
E, and the statement follows. 


4. Let K be a field and C a fixed algebraic closure of K. Denote by C, the 
separable closure of K in C. Then C;/K is a Galois extension, and any Galois 
extension £/K with ground field K can be regarded as intermediate to C;/K. 


Infinite Galois extensions 125 


Thus the Galois extension C;/K might prove to be an especially worthy object of 
study. However, C;/K is generally not a finite extension. The question, then, is 
to find an appropriate generalization of the fundamental theorem of Galois theory 
from finite to arbitrary Galois extensions. That the theorem as given in Chapter 8, 
Theorem 5 does not apply to infinite Galois extensions is shown by the following 
example. What happens is that the one-to-one correspondence between intermediate 
fields and subgroups of the Galois group breaks down — we know from Chapter 8, 
Theorem 2 that there is an injective map from the former to the latter always, but in 
the infinite case different subgroups of the Galois group may have the same fixed 
field. 


Example. Let K = F, be the prime field of characteristic p > 0 and set Fy := C = 
C;. Denote by g € G(C/F,) the corresponding Frobenius automorphism, defined 
by gx = x?. The fixed field of the subgroup H = (g) of G(C/F,) generated by ¢ 
consists of all elements x € C such that x? — x = 0; therefore 


(26) fixed field of (gy) = F. 


Nevertheless, we will show that the group corresponding to the intermediate field 
Fy does not coincide with H: 


(27) G(Epoo/Fp) # (9). 


To do this we take any prime g and consider the subfield 
(28) F = |J Eom 
m 


of F,o0 consisting of all x € Fyo such that xe" = x for some m € N. For every 
x € F, the degree F,(x):F, is a q-power. Therefore F # F,-0, and consequently 
there exists 

t €G(Fp~0/F) such that t ¥ 1. 


Now, if (27) were not satisfied, t would be a power of yg, say t = y”, where we may 
as well assume n € N (otherwise replace t by t~!). Then F would be contained 
in the fixed field F,» of g” = t. But this is impossible, since Fp» /F, has degree n, 
whereas the field F in (28) has infinite degree over F,. 


In the sequel, assume given an arbitrary Galois extension E/K, with Galois 
group G = G(E/K). If F is an intermediate field of E/K that is Galois over K, 
and if o € G, we denote by 


oF 


the automorphism of F' arising from o. (We switch away from the more natural 
notation of used up to now for reasons of convenience, which will soon become 
obvious.) 
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Now, we know that any element of EF lies in a subfield L of E such that L/K 
is finite and Galois. Thus, if we let ZL run over all intermediate fields of E/K with 
this property, the natural homomorphism 


h:G > [| G(L/K) 
(29) fs 
oH (o")p 


is injective. In the sequel we will continue to use L as a running index to designate 
all intermediate fields L of E/K such that L/K is finite and Galois. 
How can the image of h in (29) be characterized? For L C L’ there is a canonical 
map 
Sry: G(L'/K) > G(L/K) 


ther 


and obviously if L C L’ C L” we have 
fiji = fiji © fiji": 
Moreover for o € G the components of /(c) satisfy 
fru(oX)=oF if LCL’. 
This leads us to the following notion: 


Definition. Let J be a (partially) ordered set of indexes; assume further that J 
is directed, which means that for any i,i’ € J there exists 7 € J with i < j and 
i’ < j. Assume given a family (G;);e7 of sets (groups, rings, topological spaces, 
etc.) together with maps (homomorphisms) 


Sij : Gj > G; 


for each pair (i, 7) of indices in J such that i < j. This setup is called a projective 
system if in addition we have 


Sik = fiz° fix whenever i <j <k. 


The projective limit of such a projective system is defined as the following subset 
of the cartesian product of the G;: 


(30) lim G; := {(;); € [] G | fiy(oj) = 0% for i <j}. 
iel iel 


When the G; are groups or rings, the projective limit is obviously a subgroup or 
subring of [] G;; if the G; are topological spaces, the projective limit is a subspace 
of the topological space [[ G;, and because all the fj; are assumed continuous, it 
is in fact closed in || G; if all the G; are Hausdorff. 
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Now let’s get back to the situation that we had set up starting from an arbitrary 
Galois extension E/K. Thanks to the notion just introduced, we can state: 


F4. If E/K is an arbitrary Galois extension, the map (29) yields an isomorphism 


(31) G(E/K) — lim G(L/K), 


where L runs over the set of all intermediate fields of E/K that are finite and Galois 
over K. 


Proof. Everything is clear except the surjectivity of (31). So let 


(32) (ox)z € lim G(L/K) 


L 


be given; we must show that there exists 0 € G(E/K) such that 
(33) o- =o; for every L. 


Since, as mentioned, £ is the union of the L, an element o € G(E/K) is fully 
determined by the conditions (33). Conversely, the existence of such a o will be 
obvious if we can prove that, for any L;, L2 in our index set, the maps oz, and 
oz, coincide on Lo := L, NM Ly; in other words, that 


Lo _ Lo 
Or, =): 


Let L := L,L, be the composite of L; and L, in LE. Because of (32) we have 


L 3 
Op = Seyi (OL) = On) 07,7? = fist OL) = OL: 


This indeed implies that 


L L L L L 
OF) = (071)? = 07° = 0,7)" = 07). 
Definition. We talk of a topological group G when G has, besides a group structure, 
also a topology such that the map (x, y) + xy—! of G x G in G is continuous. 


Remark. If (G;)ie7 is a family of topological groups G;, the cartesian product 


[Gi 


ie. 


is of course also a topological group. If (G;)ie7 is a projective system, the projective 
limit 

lim Gi 

iel 
is a topological group, since it is a subgroup of [] G;. As remarked earlier, the 
projective limit is in fact closed in G; if the G; are Hausdorff. 
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Definition. Let E/K be any Galois extension. We endow the Galois group G(E/K) 
of £/K with a natural topological group structure as follows: Give each finite group 
G(L/K) in (31) the discrete topology and then simply transfer to G(E/K) the 
topological group structure of 


(34) G = lim G(L/K), 
L 


via the isomorphism (31). The resulting topology on G(£/K) is called the Krull 
topology. 


F5. Let E/K be a Galois field extension. The Galois group G = G(E/K) of E/K 
becomes a compact topological group with the Krull topology. The family 


(35) (G(E/L))z, 


where L runs over all intermediate fields of E/K that are finite and Galois over 
K, is a fundamental system of open neighborhoods of | in G; that means, first, that 
each G(E/L) is open, and second, that any neighborhood of | in G contains some 
G(E/L). 


Proof. By the well-known theorem of Tichonov (or Tychonoff), the cartesian prod- 
uct of compact topological spaces is compact. The spaces G(L/K) are trivially 
compact, being finite and discrete; thus the projective limit (34), being a closed 
subset of the cartesian product of the G(L/K), is also compact. 

Let J be the set indexing the intermediate fields L. If S runs over the finite 
subsets of J, the sets 


Us:= [1 {1} x TP] G(L/K) 
Les L¢S 


form, by the definition of the product topology, a fundamental system of open neigh- 
borhoods of | in the cartesian product of the G(L/K). Thus their intersections with 


G = limG(L/K) 


also form a fundamental system of open neighborhoods of 1 in the topological group 
G. For given S, let L be the composite of all the L’ € S. It follows easily from the 
definition of the projective limit that 


Us NG =U NG. 


But the inverse image of Us7) 9 G under the map in (31) is no other than the 
subgroup G(E/L) of G = G(E/K). This proves F5. 


We can now extend to infinite Galois extensions the fundamental theorem of 
Galois theory, stated in Chapter 8, Theorem 5 for the finite case: 
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Theorem 4. Let E/K be any Galois extension. The map 
(36) Fw G(E/F) 


is a bijection between the set of intermediate fields of E/K and the set of closed 
subgroups of G = G(E/K). This bijection maps intermediate fields of finite degree 
over K to open subgroups of G, and vice versa. 


Proof. (i) Any open subgroup H of a topological group G is closed in G, because 
G \ H is aunion of cosets gH # H, and every gH is open, being the image of an 
open set H under the homeomorphism x + gx of G. 


(ii) Let F/K be a finite intermediate extension of E'/K and denote by L the normal 
closure of F/K in E. Then L/K is also finite. By F5, G(E/L) is open in G. Since 
G(E/L) is contained in G(E/F), the latter group is also open (being the union of 
cosets gG(E/L), all of which are open). 


(iii) As we know, for every intermediate field / of E/K the extension E/F is 
Galois, so F is the fixed field of G(E£/F) in E: 


(37) FeEee, 


see Chapter 8, Definition 1 and Theorem 2. From (37) we immediately deduce that 
the map (36) is injective. 


(iv) We claim that G(E/F) is always closed in G. For let o be an element of G and 
take o € G(E/F). There exists an intermediate field Fo of F/K for which Fo/K 
is finite and on which o is nontrivial — meaning that o € G(E/Fo). Therefore 


oG(E/Fo)NG(E/F) = &. 


This justifies the claim, since the finiteness of Fo/K implies that oG(E/Fo) is an 
open neighborhood of o in G, by (ii). 


(v) Let H be an open subgroup of G. By F5, there exists a Galois subextension 
L/K of E/K such that 
G(E/L) CH. 


Let F = E” be the fixed field of H. From (37) we then get 
BSE Cpe Gh fF, 
so F'/K is finite because L/K is. 


(vi) To complete the proof of the theorem all we need to do is show (and this is the 
nub) that every closed subgroup H of G satisfies 


(38) H =G(E/E®). 
First let H be any subgroup of G. Trivially, H C G(E/E”). We show that 


(39) GEE) =i, 
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where H™ is the closure of H in G. Let F = E” be the fixed field of H in E, and 
let o be any element of G(E/F). To prove that o € H7 it is sufficient to check that 
H intersects every fundamental neighborhood oG(E£/L) of o in G = G(E/K): 


(40) oG(E/L)NH#®@. 


To see this, let Ho be the image of H under the canonical map G(E/F) > 
G(LF/F). Then F is also the fixed field of Ho in LF. But the Galois extension 
LF/F is finite, so the fundamental theorem for finite Galois extensions (Chapter 8, 
Theorem 5) yields 

Hy = G(LF/F). 


Thus for the given o € G(E/F) there is t € H such that o4 = t¥. Setting p = 
o'r € G(E/L) we conclude that op = T lies in the intersection (40). 


Remarks. Let F/K and F’/K’ be Galois extensions with K C K’ and F C F’. 
The natural homomorphism 


(41) +: G(F'/K’) > G(F/K) 


is continuous, because if G(F'/L) is a basic open neighborhood of | in G(F/K), 
then G(F’/LK’") is an open neighborhood of 1 in G(F’/K’), and G(F"/LK"') © 
r!(G(F/L)). 

The fixed field of r(G(F’/K’)) in F is obviously FM K’. Since r(G(F’/K’)) 
is compact (being a continuous image of the compact G(F’/K’)) and hence also 
closed in G(F/K), it follows from Galois theory (Theorem 4) that we have an 
equality 


(42) r(G(F'/K’)) = G(F/FO K’). 
Thus the map r in (41) gives rise to a surjective homomorphism 
(43) G(F'/K') > G(F/FOK’) 


of topological groups; its kernel is G(F’/F'K’). We claim that the map (43) is open. 
To justify this we must show that the earlier map r in (41) is open if FM K’ = K. 
So take G(F’/L’), a basic open neighborhood of 1 in G(F’/K’); we must show 
that the image r(G(F’/L’)) = G(F/FNL’) is open in G(F/K). By assumption, 
G(F’/L’) is a normal subgroup of finite index in G(F"/K"); this carries over to their 
homomorphic images, so that G(F/F ML’) is a normal subgroup of finite index 
in G(F/K). Thus FO L’/K is Galois with finite Galois group G(FNL'/K) ~ 
G(F/K)/G(F/FOL’), and therefore r(G(F'/L')) = G(F/FNL’) really is an 
open neighborhood of 1 in G(F/K). 

In particular, if F is an intermediate field of a Galois extension E/K and F/K 
is also Galois, the restriction homomorphism G(E/K) > G(F/K) is continuous 
and open, and so gives rise to a canonical isomorphism 


G(E/K) /G(E/F) ~ G(F/K) 
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of topological groups, where the quotient group G(E/K)/G(E/F) is given the 
quotient topology (that is, the finest topology for which the quotient map G(E'/K) > 
G(E/K) /G(E/F) is still continuous). 

Incidentally, one can easily check that, if F is an intermediate field of a Galois 
extension £'/K, the Krull topology of G(£/K) induces on the subgroup G(E/F) 
the Krull topology of the Galois group G(E/F). 

Another thing that can be checked easily: If H is an open subgroup of the Galois 
group G = G(E/K), then H has finite index in G, and in fact G: H = F: K, where 
F is the fixed field of H in E. 


13 


Norm and Trace 


1. We mentioned in Chapter 1 that, by regarding an extension E of a field K as 
a vector space, we gain the ability to use the powerful tools of linear algebra. It 
is true that after Chapter 3 this viewpoint receded to the background, and only in 
Chapter 12 did we start making frequent use of it again. But in this chapter we will 
examine in our context some simple but effective concepts from linear algebra. 


Notation. Departing from our practice so far, we will use the letter K in this chapter 
to denote not necessarily a field, but any commutative ring with unity. 


If A is a K-algebra and M is an A-module, we consider for each a € A the 
K-endomorphism 
aM ixhax 


of M. We also write ayy/x instead of ays. 


Definition. In the situation above, assume in addition that / is a finitely generated 
free K-module; this means M has a finite K-basis. The characteristic polynomial 
P(auyx) of yx € Endx(M) is called the characteristic polynomial of a with 
respect to the A-module M. It is an element of K[X], and we denote it by 

Pu/K (a) = Puy (a: X). 
Likewise, we call 

Tryg/K (a) := Trace aysK 
the trace of a, and 

N/K (@) = det am/K 


the norm of a, always with respect to the A-module M. 

The following properties of the norm and trace are obvious: 
(1) Tru/x (a + B) = Truyx (a) + Trw;x (6), 
(2) Try/x (aa) =aTry/x(@) forae K, 
(3) Tryx (@B) = Truyx (Ba). 
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It follows from (1) and (2) that Tryg;x : A > K is a K-linear form on A, and by 
using (3) as well we see that the map (a, 8) > Trag/x (a) is a symmetric bilinear 
form on A. 

Next we have 


(4) Nujx (@B) = Nuk (@)Nu/x (6). 


If C(aw) = (cj;(@));,; is the matrix expressing ajyg/x with respect to a K-basis 
€1,...,€n Of M, we have 


(5) Tr (a) = Trace C(a) = 97; cji(@), 
(6) Nm/x (a) = det C(a@), 
(7) Pyjx (a; X) = det(X In — Ca). 


where J, is the n xn identity matrix. If Pyg;x (a) has the form X” + Gyn XE 
+--+ do, we have 
(8) Try/K (a) =—an-1, 
(9) Nu/x(@) = (-1)"ao. 
F1. In the preceding situation let M, be a submodule of the A-module M and set 
M2:=M/M,. If M, and M2 are both finitely generated free K-modules, M is also 
one, and for each a € A we have 

Trax (@) = Trm,/x (a) + Trm,/K(@), 

NuyjK(@) = Nuy/K(@): Nup/K(@), 

PujK (a; X) = Puysx (a; X)- Puy/K (a; X). 

In particular this is true when M ~ M, ® Md. 


Proof. Let e1,...,@m be a K-basis of M,. Choosing representatives f|,..., fn for 
the elements of a basis of M/M, = M) obviously yields a basis of a submodule 
of M complementary to the K-module M,. Thus ¢1,...,@m, f1,---; fn form a 
K-basis of M. With respect to this basis the matrix of some ay has the form 

Ci\(a) x 

0 Ca)’ 

where Cj (q@) is the matrix of oj, relative to e;,...,@m and C2(a@) is the matrix of 
am, Felative to fi waren Aa: The rest follows. 


Remark. Let the situation be as in Definition 1, and let L be a commutative K- 
algebra. The L-module My, = M @x L can be seen in a natural way as a module 
over the algebra A, = A @x L. For all a € A we then have 


(10) Try, /L(@@ 1) =Try/x(@), Nu,/r(@® 1) = Nux(@), 
(11) Pyz/L(a@ @ 1; X) = Pu (a; X). 
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For if e1,...,e, form a K-basis of M, then e;@1,...,e,@1 form an L-basis of 
M, = M ®x L, with respect to which aj, has exactly the same matrix that aj 
has with respect to e;,..., en. 

In a nutshell, (10) and (11) say that the trace, the norm and the characteristic 
polynomial are invariant under extension of the ground ring. Carefully distinguish 
this situation from the next one: 

Let A and A’ be K-algebras, M an A-module and M’ an A’-module. As 
K-modules, let M and M’ be free and finitely generated of dimension n and n’, 
respectively. It is easy to check that the A @x A’-module M @x M’ satisfies 


(12) Truem’/K (a @ a) = Try (@)- Trax (@’), 

(13) Nem’ /x (a ® a’) = Nuyx (@)"- Nujx (a): 

Definition. Let A be a K-algebra and suppose that A is free and finitely generated 
as a K-module. The characteristic polynomial of an element a € A with respect 
to the A-module A is called the (regular) characteristic polynomial of a, and is 
denoted by P4;x(@) = P4/x(a;X). We define similarly the (regular) norm and 
trace of a, and denote them by 


Na/K(@), Tr4/K (a). 


Remark. Let A be as in Definition 2. Since N4/x,Tr4/x, Pa/x are special cases of 
Definition 1, they satisfy properties (1)—(13) above. In the role of F1 we have at least 
the following fact: If A= A; x A2x---x Ay is a direct product of finite-dimensional 
K-algebras A;, then (by Fl) we have, for every a = (@1,02,...,Q@,) € A, 


(14) TrajK (a) = 2) Tr4;/K (ou), Nayx (oo) = 1] Naj/x (4). 
U I 
(15) Pajx (a; X) =] Pa; (ai; X). 
L 
F2. Let B be a subalgebra of a K-algebra A. Suppose that both B as a K-module 


and A as a B-left module are free and finitely generated. Then the same is true of 
the K-module A, and 


(16) P4/K(B; X) = Pajx(B: X)”" for Be B, 


where m is the number of elements of a B-basis of A. Similarly, 


(17) Trayx(B) =mTrpyx(B) and Na/x(B) = Nayx(B)” for BEB. 


All three equalities follow directly from the isomorphism A ~ B™ of B-modules 
together with F1. But note that they are only valid for elements 6 in B. In trying to 
generalize for arbitrary elements in A one runs into obstacles; but see F5 and also 
§13.1 in the Appendix. 


F3. Let A be a K-algebra as in Definition 2. An element a € A is invertible in A if 
and only if N4/K (a) is invertible in K. 
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Proof. If wa~! = 1 then N(a)N(a!) = 1, so N(q@) is invertible. Conversely, 
if N(a) = deta4/x is invertible, «4x is invertible in Endx A; in particular there 
exists x € A such that ax = 1. But then N(a)N(x) = 1, so N(x) is invertible, and 
repeating the argument with x instead of a we obtain an y € A such that xy = 1. 
Left-multiplying by a yields y = a, so a is invertible (has a two-sided inverse x). 


2. We are especially interested in the norm and trace as applied to finite field 
extensions E'/K (that is, the case where A = E is a field). Thus, let E/K be a finite 
field extension and a an element of E. Consider first the intermediate field K(a) 
of E/K. If 


(18) f(X) = X" + ayn X" | ++ +a, X +a9 


is the minimal polynomial of a over K, comparing degrees shows that a coincides 
with the characteristic polynomial of « as an element of the K-algebra K(q): 


(19) f(X) = MiPox (@) = Px(a/K (a). 
Thus, in view of (8) and (9), we also have 


(20) TrK(a)/K(@) = —an-1, 
(21) Nxa)/K(@) = (-1)"ao. 


Using F2, then, we have for the extension E/K the equality 


(22) Prjx(a; X) = f(X)”", where m= E: K(a), 
and also 
(23) Tre/K(@) = —-Map-1, NeE/K(@) = (-1)"”"ag'. 


We now obtain for the trace Trz/;x (a) and the norm Ng/x(a) of an element a 
of a finite field extension E/K the following characterization (which reveals, in 
particular, that our notation is in agreement with the Tr already introduced in F3 of 
Chapter 12): 


F4. Let E/K be a finite field extension. Let C be an algebraic closure of K and let 
G = G(E/K, C/K) be the set of all homomorphisms of E/K in C/K. If E/K is 
separable we have, for anya € E, 


(24) Trz;x(@) = >> oa 
aEG 

and : 

(25) Nex (a) = [| oa. 
o€G 


More generally, without the separability assumption we have, for any a € E, 


(26) Trex (#) =[E:K]; )) oa 


ocG 
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and 
[E:K]; 
(27) Ne/K (a) = ( I] oa) ; 
oEG 
where [E': K]; is the inseparable degree of E/K. 
Proof. We may as well assume that E CC. 


(i) Assume first that E/K is separable. For a given a € E, set F = K(a). Let 
n= K(qa): K and denote by (1,..., Pn the n distinct K-homomorphisms of K(a) 
into C. Then p1a@,..., Ona are the distinct roots of f := MiPox (a). This, together 
with (20) and (21), shows that 


n n 
Trx(@)/K(@) =—dn-1 = D2 pia = and Nxw/K (a) = (—1)"a0 = T] pie. 
i=1 i=1 


We know from Chapter 7 that each p; € G(F/K,C/K) has exactly m:= E: F 
distinct extensions o € G(E/K, C/K). Thus we have 


n 
dV oa=m YO) pia = MSxK)/K (&) = Trex (@), 
ocG i=1 
which is (24). The norm is dealt with similarly. 
(ii) In the general case, let E's be the separable closure of K in E. By definition, 


[E:K]; =[E: Es]. 


If char K = 0 we have E = Ey, so [E: K]; = 1 and there is nothing to prove. So 
we assume char K = p > 0; then the degree [E': K]; is a p-power p*, by F17 in 
Chapter 7. For every a € E, the element a?* lies in E,, and we have 


Nex (a)? = Nejx(a?) = Ng,/K(a?)”, 


sO 
Nejx(@) = Ne,/x(@?) = []o(a?*), 


by part (i), where o runs through the set G(E;/K, C/K). But the restriction map 
G(E/K, C/K) > G(Es/K,C/K) 
is bijective, so we finally get 


Ne/x (a) = [I ofa)”, 
ocG 

which proves (27). There remains to prove (26). In the case a € E;, we have 
Tre/x(@) = [E: Es|Trz,;x (a) and the assertion is clear. So take a ¢ Es. The 
minimal polynomial f(X) of a over K then has the form f(X) = g(X”), for 
some g € K[X]. Using (20) we see from this that Trx(e)/x(a@) = 0, which also 
means that Trz/x (a) = 0. But since char K = p and [E: K]; = p® > 1, nothing 
further is needed. 


138 13 Norm and Trace 


F5 (Nesting formulas for trace and norm). Let E'/K be a finite field extension and 
L an intermediate field of E/K. Then 


(28) TreyK = TrzKx OTrey/z, 
(29) Nejx = Nix ° NE|L- 


Proof. Let C be an algebraic closure of K and RC G(C/K) a set of representatives 
for M := G(L/K, C/K), meaning that for each 4 € M there is a unique p € R 
such that pp = ww. Any o € S := G(E/K, C/K) then has a unique representation 


o=pt, withoe RandteT:=G(E/L,C/L) 


(note that G(C/K) is a group, by F3 in Chapter 6). For every a € E we then get 


[] oa= |] pra= J] p( I] ta) = I] n( I] ra). 
oes pER peR TET ueM TET 
teT 


This, together with (27) and the equality [E: K]; =[E:L];[L: K]j, leads to 


(30) Ne/x(@) = Nyx (Ne/L(@)) 


as desired. The corresponding formula for the trace is derived similarly. 


For nesting formulas in the context of linear algebra see §13.1 in the Appendix. 


3. After the generalities of the last two sections, we are now ready for some specific 
properties of the trace and norm as applied to field extensions. 


F6. For every finite separable field extension E/K the trace map Trg/K : E > K is 
surjective. 


Proof. Since Trg/x is a linear form on the K-vector space E, we just need to show 
it’s nonzero. But if we had 


Tresk(@)=)>ioa=0 forallacE 
oO 


(where the first equality comes from F4), we would be in contradiction with the 
linear independence of the elements o of G(E/K, C/K), guaranteed by Theorem 2 
of Chapter 12. 


In contrast, if E/K is inseparable, Trz/x is the zero map, by (26). 
F7. For every finite separable field extension E/K, the map 


ExE>K 
(a, B) +> Trex (@B) 


is a nondegenerate symmetric bilinear form on the K-vector space E. 
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Proof. Take a # 0. If Trz;x (af) = 0 for every B € E, then Trz/x(y) = 0 for all 
y € E, contradicting F6. 


F8. Let E/K be a finite Galois field extension and G = G(E/K) the Galois group 
of E/K. The kernel of Trg;xK : E — K is made up of all finite sums of elements of 
the form 


(31) ta—a, whereae EandteG. 


If G is cyclic and generated by o, the following statements are equivalent for a given 
Vek: 

(i) Treyk(y) = 9. 

(ii) There exists a € E such that y =oa-«a. 


Proof. By Theorem 3 in Chapter 12 there exists B € E such that (tB)reg is a 
K-basis for E. Now let 


(32) y= Sart, with a, € K, 
tTEG 


be an arbitrary element of E. An application of Tr := Trg/x yields 


(33) Try= > a, Tr(tB). 
T 
But for any a € E, 
(34) Tr(ta) = > ota = DS pa =Tra, 
oc€G pEG 


so the element in (32) has trace 
(35) Try = mp)( ar), 
tTEG 
by (33). Since (tB)reg is a basis for E/K, we have 
Trp = > TB #0. 
tTEG 


By (35) we then see that Tr y = 0 is equivalent to )° a; =0. We also have, in full 
generality, teG 


> et B - (Da)s = )> ax(tB — B) = D> (t(arB) — ac). 


Taking it all together we see that Tr y = 0 if and only if y is a sum of elements of 
the form (31). 
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Now let G be cyclic with o as a generator. Clearly the set M:={oa—a|ae E} 
is a subgroup of the additive group of E. What remains to show is: For every t = o* 
in G and every a € E, each difference ta —a@ lies in M. This follows from 


k k k 


o-a-a=ao a-o hy ge 


a—++-+oa—-a 


=a(o*"a+o%7a4+---+a)— (08 'a + of 7a +--+ +0). 


It is of great significance that there is a direct multiplicative analog for F8 in the 
cyclic case (whereas surprisingly this fails to be the case in general; but see §13.4 
in the Appendix): 


F9 (Hilbert’s Theorem 90). Let E/K be a finite Galois extension with cyclic Galois 
group G = (o). Fora giveny € E™, there is equivalence between: 


G) Nek (y)=1. 

(ii) There exists a € E* such that y = ses 

o(a) 

Proof. That (ii) implies (i) is obvious. We prove the converse. First take an arbitrary, 
but fixed, 6 € E. Denote by n the order of G. Our first candidate for the desired 


element a will be 
n—-1 


a=) > djo'(B) with A; € E, 
i=0 


where we choose Ao = | without loss of generality. Then 


n-1 n 
ou =) Vo(di)o'*"(B) =) o(i-1)0' (8). 
i=0 i=l 
Thus the equation 
(36) a = yo(a) 


will certainly be satisfied if the following conditions hold: 


(37) Ai=yo(Ai-1) for 1<i<n-1, 

(38) 1 = yo(An-1). 

The conditions (37) are equivalent to 

(39) Mi=yo(y)o*(y)...07 Ny) for 1<i<n-—1. 


This determines all the A; (given that we took 49 = 1), independently of B. In 
particular, we have An—1 = yo(y)...0”~*(y) and consequently 


O(An—1) =O(y)o7*(y)...0" (vy) =v | NgyK(y). 


Hilbert’s Theorem 90 141 


Thus, if Nz/x(y) = 1, as assumed, (38) is likewise a consequence of (39). We 
therefore see that the element 


(40) a=B+yo(B)+yo(y)o7(B) +--+ (yo(y)o" *(y))o"""(B) 


satisfies (36), for every 6 € E. We now have to arrange for a in (40) to be nonzero; 


this is possible with the right choice of B because 1,0,...,0”~! are linearly inde- 
pendent over E. In conclusion, then, we have found a € E™ such that 
a 
41 = SAS 
(41) aes 


F10. If K is a finite field and E/K is a finite extension, the norm map Neg;xK is 
surjective. 


Proof. The norm map N = N¢g;x gives rise to a homomorphism 
(42) N:E* > K* 


of multiplicative groups always; we must show that this map is surjective if K and 
E are finite. We know that £/K is Galois with cyclic Galois group G(E/K) = (o). 
Now consider, besides (42), the homomorphism 


(43) 56: EX > E* 


defined by 6(a) = a/oa. By applying the fundamental homomorphism theorem for 
groups to (42) and (43) we get, in particular, the cardinality relations 


jim N|-|ker V| = |E*| = |im6]- |ker 6]. 
We have ker 6 = {a € E* | ca = a}, so by Galois theory, 
kerd = K*. 
By Hilbert’s Theorem 90 (see F9) we get 
imé = ker N. 


There follows |im NV | = | K*| and hence the desired conclusion K* =im JN. 


The fact stated in F10 is of considerable importance in algebra and in number 
theory. Here we derived F10 as a nice application of Hilbert’s Theorem 90, but it is 
worth remarking that another proof can be given: We know that the multiplicative 
group E%* of E is generated by some element ¢. Letting g denote the number of 
elements of K, we must show that the element Vz/x (¢) of K* has order g—1. This 
we do by noting that the Galois group G(E/K) is generated by the automorphism 
Og:at> at. Setting n = E: K, we obtain 


(44) Nex (6) = ceqee uf ca = gltata?ttg" 


Since € has order g” —1 = (q—1)(1+q+q?4+---+q"'), the element Nz/x (6) 
in (44) does indeed have order g — 1. 
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Binomial Equations 


1. Let K be a field. We consider polynomials of the form 
(1) S(X) =X" —y € K[X], 


with n EN. The roots of such a polynomial (in a splitting field E of f over K), that 
is, the solutions of the binomial equation X" — y = 0, are called n-th roots of y. 
(A binomial equation is also sometimes called a “pure equation’’.) 

In this chapter we will always assume y 4 0. The derivative of f(X) is f’(X) = 
nX"—!, Thus f(X) is separable if and only if char K is not a divisor of n. 


F1. Assume that the field K contains a primitive n-th root of unity (in particular, 
char K is not a divisor of n). Let y € K*. Then the Galois group G of the binomial 
equation X" — y = 0 over K is cyclic, and its order divides n. 

Moreover, if a is an n-th root of y in a splitting field E of X" —y over K, the or- 
der of G is the smallest natural number d such that a4 € K. The polynomial X4—a4 
is then the minimal polynomial of a over K. The extension E/K is generated by a. 


Proof. Let the notation be as above. In particular, let a € EF be a fixed root of 
X”"—y,so a" = y. Clearly, the elements a, where ¢ runs over all m-th roots of 
unity in K, are roots of ¥” —y in E, and there is no other. Thus EF = K(q). 

Now take o € G. Since @ is a root of X¥"” —y, so is oa; thus ow = fa fora 
unique root of unity ¢ = €(o). This defines an injective map 


(2) ot ~*~ =e) 
a 


from G to the group of n-th roots of unity. By assumption, all n-th roots of unity 
lie in K; this implies that (2) is a homomorphism, because by applying p € G 
to ow = F(a )a we get pow = F(a)pa = F(a) e(p)a, so S(po) = §(p)F(a). In 
summary, the Galois group G is isomorphic to a subgroup of the group of n-th 
roots of unity. The latter is cyclic of order 1, so G is cyclic and its order divides n. 
Let d be the order of G and o a generator of G. Then = oa/a is a primitive d-th 
root of unity. Since 


o(a“) =o(a)4 = e4q4 =a, 
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the Galois correspondence shows that w@ is an element of K. Conversely, if we 
assume that aw? € K, then 


at ae o(a?’) = o(a)” = ae 
so ¢4’ = 1. Thus d’ is divisible by d. 


There remains to show only that MiPox (aw) = ¥4 — a. On the one hand, a is 
a root of X4 — a4; on the other, deg MiPox (w) = K(a):K = E:K =|G| =d. 


Remarks. Still in the situation of Fl, let %/y denote an arbitrary n-th root of y in 
E. Also let 


(3) d= K(uy):K. 


By F1 we have n = rd for some r € N. The element 6 := (ay )4 lies in K, and 
we have y = f” and 


(4) K(Y/V) = K(V/B). 


Also it follows immediately from FI that the order d of the Galois group of 
K(2/y)/K coincides with the order of the element of K*/K*" determined by y. 


Theorem 1. Let E/K be a finite Galois extension with cyclic Galois group (we call 
E/K acyclic extension). Let n be the order of the Galois group G of E/K. If K 
contains a primitive n-th root of unity, E is obtained from K by adjoining an n-th 
root of an element of K; in other words, E = K(a), where a € E is a root of a 
polynomial X" —y € K[X]. 


Proof. Let K contain a primitive n-th root of unity ¢, and let o be a generator of 
G. We seek an element a € E* such that 


oa 
(5) —=6. 

a 
Such an a exists if and only if ¢ satisfies Nz;x(¢) = 1 (see Chapter 13, F9, alias 
Hilbert’s Theorem 90). This condition is satisfied since ¢ € K (so Nex (f) =" = 


1), so there is indeed a in E* such that ow = fa. From this we get 


a(a") =a(a)" = C"a" =a". 


Since o generates all of G, the power a” must already lie in K. Setting y := a”, 
we see that a@ is a root of the polynomial X¥” — y € K[X]. Also from (5) we get 


ola=Cla: 


thus a has n distinct conjugates a,fa,f7a,...,¢"-!a over K. There follows 
K(a):K =n=E:K, so E = K(qa). This concludes the proof. 
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Remark. In our proof of Hilbert’s Theorem 90 on page 140, we found an @ with 
the desired property by first setting 


n—-1 
(6) cay oa): 

i=0 
with 6B € E. The expression on the right is called the Lagrange resolvent and is 
sometimes denoted by (¢—'!, 8). For appropriate B ¢ E, as we have seen, 


(7) (E58) 0; 


so a = (€—!, B) is a primitive element of E/K whose n-th power is in K. Inciden- 
tally: if B € E satisfies (7), it is a primitive element of E/K; but the converse is 
not true in general. 


F2. Let g be a prime number and y an element of K*. Either y is a q-th power in 
K or the polynomial X4% — y is irreducible in K|X}. 


Proof. Suppose that y is not a g-power in K. Let E be a splitting field of X7 —y 
over K anda aroot of X7—y in E. 

If char K = q we have X79 —y = X4— a7 = (X — a@)!, so E = K(q) and 
E/K is purely inseparable (see for instance F16 in Chapter 7); then X% — y is the 
minimal polynomial of a over K (Chapter F15). 

So assume instead that char K #4 g. Then X4%—y is separable, hence FE contains 
a primitive g-th root of unity ¢. Suppose, for a contradiction, that X¥%—y is reducible 
over K; then it is a fortiori reducible over K(¢), and by F1 it must split into linear 
factors over K(¢), since g is prime. Thus 


K(a) ¢ K(¢). 


The extension K(¢)/K is Galois and its Galois group G is cyclic (being isomorphic 
to a subgroup of the cyclic group (Z/qgZ)*; see F9 and Theorem 2 in Chapter 9). 
Let o be a generator of G; then 


(8) oa = na, 


where 7 is a q-th root of unity, and in fact a primitive one (7 = | is excluded because 
it would imply that G fixes a, hence a € K, whereas we’ve assumed that y is not a 
q-th power in K). Since n ¢ K, we cannot deduce from (8) things like o/a@ = n/a, 
for instance. Instead we take the ratio 0(n)/n, which is also a primitive q-th root 
of unity, so that 


k 
n= (2) for some 0<k <q -1. 


0 
Together with (8) this leads to 
( a ) no na a 
o\— SS — eS SS ee 
Wee eye aay. <a 


As before this implies that a/n* lies in K. There follows (a/n*)? = a? = y, so y 
is a q-th power in K after all and we get the desired contradiction. 
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Remark. Still in the situation of F2, let K’ denote the field of g-th roots of unity 
over K. The proof we just gave actually shows something stronger than F2: Either 
y is q-th power in K, or X4 —y is irreducible even over K'. And we explicitly 
restate the following consequence of F1: If K’ = K, either X4% — y is irreducible in 
K[X] or X4 —y splits into linear factors over K. 


The statement of F2 enters crucially into the proof of the following general fact: 
Theorem 2. For y € K™, the polynomial X" — y is irreducible over K if and only if 
the following conditions are met: 

(a) There is no prime factor q of n such that y is a q-th power in K. 
(b) Ifn is divisible by 4, there isno X € K such that y = —4i4. 
Proof. Sufficiency. We work by induction on n. For n = | there is nothing to prove. 
Take n > 1 and assume (a) and (b). 
Let g be a prime factor of n and set m = n/q; also let a be a root of X"—y. By 


induction we can assume that X” — y is irreducible, so K(a?): K =m. Consider 
the polynomial X¥% — a? over K(a7). If this is irreducible, we have 


K(a): K = [K(a): K(a@’)]-[K(@?): KJ] =q-m=n, 


and therefore X” — y is irreducible over K, as desired. 

If, instead, X4 —a@? is reducible over K’ = K(a?%), we know from F2 that a is 
a q-th power in K’. We will analyze various subcases, in each one getting either a 
contradiction or the irreducibility of ¥” — y. Write 


(9) at = p41, with Be K’. 


Let N = Nx’/x be the norm map for the extension K’/K. Applying N to (9) we 
get 


(10) N(B)4 = N(B%) = N(@*4) =-y(-1)™, 


since X’ — y is the minimal polynomial of a? over K. 

If m is odd, (10) expresses y as a g-th power in K, contradicting (a). 

If m is even but g is odd, (10) says that (—N(f))? = y, so again y is a q-th 
power in K. 

Free as we are to choose qg, we have covered all cases except 


n=2°, withs>2. 
In this latter case (10) says that 
(11) —y=6*, withéeK. 


Suppose —1 = 7? in an extension of K; then i ¢ K, otherwise y would be a square 
in K, by (11). Over K(i) we have the decomposition 


X7 —y = (KX?! 4:18) (X73). 
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If the first factor on the right is irreducible over K(i), so is the second, since the two 
are conjugate; then the uniqueness of prime factorizations in K[X] and in K(Z)|X] 
(together with the fact that i ¢ K) yields the irreducibility of ¥?’ — y. 

Finally, suppose instead that 


HS 


is reducible over K(i). By the induction hypothesis, either 15 is a square in K(i) 
or i6 equals —4A* with A € K(i); in the latter case 15 is a square in K(i) all the 
same. Thus there exist A, 4 € K such that 


i6=(A+ip)? =)? 4 2api—p?. 
There follows 47 = ju? and 2Aju = 6. By squaring we get 5? = 4A* and then, taking 
(11) into account, y = —444, contradicting (b). 


Necessity. Let n = qm with q prime and m > 1, and suppose y = 6% in K. Then 
X41 —y is not irreducible over K, and so neither is (X”)? — y. That shows the 
necessity of (a). 

Let n = 4m with m > 1, and suppose y = —4A* with 4 € K. Then 


X4—y = X44 444 = (x? +207) —(20X)* 
(XP SO AOA Or hy 4: 


Thus X*— y is not irreducible over K, and so neither is (X’”)* — y. That takes 
care of (b). 


In Fl and Theorem | we had to assume that the ground field K contains a 
primitive n-th root of unity. When p = char K > 0 and p|n this condition cannot 
be met; but we can, at least in the case p = n, prove the following substitutes for 
Fl and Theorem 1: 


Theorem 3 (Artin-Schreier). Let K be a field of characteristic p > 0. 
(1) If E/K is acyclic extension of degree p, then E arises from K by adjoining a 
root of a polynomial of the form 


X?—-X-y, with ye K. 
(II) Conversely, given a polynomial of the form 
f(x) =X? -X—-ye KX], 


with splitting field E over K, either E = K or f is irreducible over K. In the 
latter case E/K is a cyclic extension of degree p. 
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Proof. We prove (II) first. Let a € E be a root of f(X). For any integer j > 0, 
regarded as an element of the prime field of K, we have 


f@tj)=@+j)?-@+j)-y =a? + j?-a-j-y 
=a? —a+ j?—j-y=fla)t+j?—j =0, 
where we have used that 7? — 7 =0. Thus f has in E the p distinct roots 
(12) a,at+l,a+2,...,a+(p-l); 


in particular, f is a separable polynomial, so the extension E/K is Galois. Moreover 
E = K(q). Thus, if one root of f lies in K, so do all others. Now assume that f 
has no root in K. Under any element o of the Galois group G of E/K, the image 
of a is a root of f as well, that is, 


oa=a+ jf, 
for a well defined j of the prime field of K. The map 
or oa-a 


is clearly an injective group homomorphism from the Galois group G into the ad- 
ditive group Z/pZ of the prime field of K. The latter is cyclic of prime order p. 
Since we assumed that E #4 K, we get GX Z/pZ. As claimed, then, E'/K is cyclic 
of degree p. Finally, f is irreducible because E = K(q). 


Conversely, to prove (I), let E/K be a cyclic extension of degree p, with Galois 
group G generated by o. Since char K = p, we have 


By F8 in Chapter 13, there exists a € FE such that 1 = ow — a, or yet 
(13) oa=a+t+l. 


In particular, we have oa # a; hence @ is not contained in K. This already implies 
that E = K(a), since E/K has degree p. From (13) we further obtain 


a(a? —a) =0(a)? —o(a) = (a +1)? — (a +1) 


=aP+1?-a-l=a?—-a. 


Hence the element y := a? —a is fixed by o and so must lie in K. Therefore a is 
a root of the polynomial ¥? — ¥ —y in K[X]. 


Remarks. (1) Let p be a fixed prime. If C is a field of characteristic p, consider 
the map 


(14) g:C—>C_ defined by (x) =x? —x. 


Kummer theory 149 
Theorem 3 shows that the solutions of the equation 
(a) =Y 


in fields of characteristic p play a role analogous to that of p-th roots in fields 
of characteristic 4 p. (See also §14.5 in the Appendix.) 


(2 


wm 


Is it also possible to say something about cyclic extensions E/K whose degree 
is a power of p = char K > 0? By “naive reasoning” only little; see §14.4 in 
the Appendix. But full information is provided by a subtle theory of E. Witt; 
see Section 26.5 in Volume II. 


2. Under the assumption that K contains a primitive n-th root of unity, we have 
given in Fl and F2 a description of the extensions E/K obtainable from K by 
adjoining an n-th root of an element of K; now we will investigate what happens 
when we adjoin not one, but a whole series of n-th roots of elements of K. But in 
order to do that we must go over some basic facts about finite abelian groups. 
First let G be any abelian group (written multiplicatively). Consider the set 


(15) G* = Hom(G, C*) 


of all homomorphisms x : G > C* from G into the multiplicative group C* of the 
field of complex numbers. This has the natural group structure, given by (pointwise) 
multiplication of functions; it is called the character group or dual of G, and its 
elements are called the characters of G. (We have already encountered an interesting 
example of a character, the Legendre symbol of Chapter 11.) 

We say that G is a group of exponent m if o” = 1 for every o in G. If G is of 
exponent m and y € G* is a character of G, the equalities y(a)” = x(o”) = x(1)=1 
show that the values of x are all m-th roots of unity, so x can also be viewed as a 
homomorphism 


(16) x:G > Whl(C) 


from G into the group W,,(C) of m-th roots of unity in C. 
If G = G, X G2 is a direct product, there is clearly a natural isomorphism 


(17) (G, x G2)* ~ GF x G3 

given by x (xlé,, Xe). 

F3. For every finite abelian group G there is a (noncanonical) isomorphism 
(18) G~G*. 


Proof. We use the fact (whose proof can be looked up in the next section) that G 
is a direct product of cyclic groups, G = C; x Cy x-+- x C,. In view of (17), it is 
enough to verify the existence of an isomorphism (18) in the case that G is cyclic. 
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So let G = (co), with o of order n = |G|. Then G ~ W,,(C), and we will be done 
if we show that the homomorphism 
G* > W,(C) 
x x(o) 
is an isomorphism. Injectivity is clear. Let ¢ denote a generator of W,(C), that is, 


a root of unity of order n in C. The equation x(a“) = ¢* then yields a well defined 
character x of G with x(a) = ¢; this proves surjectivity. 


F4. Let G, H be abelian groups and 
o:GxH->C* 


a bilinear (bimultiplicative) map, also known as a pairing. Then @ naturally gives 
rise to homomorphisms 


g1:G—> H* and ¢.: H > G* 
ot g(o,-) TH Q(-,T) 


If g is nondegenerate (equivalently, if p, and (2 are injective), G is finite if and only 
if H is; in this case g, and (2 are isomorphisms, and so in particular 


G~H* and H~G". 


Proof. Suppose ¢ is nondegenerate and H, say, is finite. Using F3 we successively 
get 
|G| < |H*| =|A| <|G*| = Gl, 


so G is also finite. The other assertions follow. 


FS. Let G be a finite abelian group. For every subgroup A of G the sequence 


Tres. 


(19) 1—> (G/A)* “4 Gt SS a* — 1 
is exact, that is, the image of each map equals the kernel of the next. 


Proof. For x € G*, the character res x by definition takes the same values as x 

on every a € A. For W € (G/A)*, the character inf y is defined by (inf y)(o) = 

w(o mod A) for every o € G. The exactness of (19) is then obvious everywhere 

except at A*, where it amounts to the surjectivity of res: G* — A*. But by the 

fundamental homomorphism theorem and F3 we can write 
|G*| |G| 


Ge gh, 
hee? | Seay g/l 
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F6. For any finite abelian group G the pairing 
G*xG > C* 
(X,0) > x(o) 
is nondegenerate, and so (by F4) it provides a natural isomorphism 
(20) G ~ (G*)* =: G™. 


Proof. If x(o) = 1 for every o we have x = | by definition. To show that our 
pairing is also nondegenerate in the second variable, let o be a nontrivial element 
of G; we must check that there exists x € G* such that x(a) # 1. 

Consider the subgroup A = (co) of G. By F3, A #1 implies A* # 1. Because 
of the surjectivity of res: G* — A* (see F5), there does exist some x € G* that is 
nontrivial on A; that is, the condition x(a) # 1 is satisfied. 


We now come to the previously announced field-theoretical applications of these 
results. We need: 


Notations and assumptions. Let K be a field, C an algebraic closure of K and n 
a natural number. We assume that K contains a primitive n-th root of unity. 

Given a subset A of K™*, we define the set of n-th roots of elements of A (in 
the chosen algebraic closure C of K) by 


VA ={aEC|a"e A}. 


Consider the extension E = K(*/A) in C, which is clearly Galois. If A, = (A) 
is the subgroup of K* generated by A, we obviously have K(/A) = K(*/A}). 
Thus from now on we may assume that A is a subgroup of K*. Denoting by K*” 
the subgroup of n-th powers of elements of K*, we can also assume that 


Kx? = A, 


if necessary after replacing A by the subgroup AK*” of K™. 


We speak of an abelian field extension L/K when L/K is Galois and its Galois 
group is abelian. 


F7. Suppose K contains a primitive n-th root of unity and let A be a subgroup of K* 
such that K*" © A. Then the extension K(</A)/K is abelian and its Galois group 
G is a group of exponent n. There is a canonical pairing 


Gx A> W,(K) 


21 oa 
ey (c,a)he —, wherea” =a. 
a 


This in turn gives rise to a nondegenerate pairing 


(22) Gx A/K*" —> W,(K). 
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K(VA)/K is finite if and only if the group A/K*" is finite, and in this case there 
are natural isomorphisms 


(23) G~(A/K*")*,  G*~ A/K*". 


Here, for an abelian group G of exponent n, the dual is considered to be the group 
G* = Hom(G, W,,(K)). Otherwise the isomorphisms in (23) are not canonical; 
indeed we have W,,(K) ~ W,(C), but not, in general, canonically. 


Proof of F7. We set E := K(4/A). For o € G and a € E% such that a” =aeA 
we have (o@)” = a, so there is a unique m-th root of unity € = €(0,a@) such that 
o 


(24) of 6 ay: 
a 


By assumption, ¢(0,q) is in K. If some a’ € E also satisfies a” = a" = a, there 
is 7 € W,,(K) such that a = na’; there follows 


a a a a’ 


oa _ a(na’) _ noa’) _ o(@’) 


Thus the map (21) is well defined. It is multiplicative in the second variable, since 
obviously 
o(ap) _ o(a) 018) 
op a B 
Since €(¢,a@) € K, applying some p € G to both sides of (24) immediately gives 
poa = C(0,a)pa = F(0,a)C(p,a)a, and hence 


$(po,a) = F(p,a) f(a, a). 


Thus (21) is multiplicative also in the first variable. Moreover we see that the 
Galois group G is abelian. Take o € G. If oa/a = 1 for any @ such that a” € A, 
we obviously get o = 1. Thus the pairing (21) is nondegenerate with respect to the 
first variable. On the other hand, let a” = a € A be given; then a necessary and 
sufficient condition for o~/a to equal | for all o € G is that w belong to K*, which 
is to say that a lie in K*”. Thus one gets from (21) the nondegenerate pairing (22). 
The remaining assertions now follow immediately from F4, with the replacement 
of C* by W,,(K). 


Remark. From F7 we recover F1 by taking A = (y) K*". 


Definition. A Galois extension E/K is called abelian of exponent n if GLE/K) is an 
abelian group of exponent 1. An abelian extension E'/K where K contains a prim- 
itive n-th root of unity is also called a Kummer extension, in honor of E. Kummer 
(1810-1893), who earned immortal fame for his profound work in number theory. 
Kummer himself actually considered only certain special extensions that arose in 
his number-theoretic studies; the more general, purely algebraic laws of what is now 
called Kummer theory came later. Their usefulness in number theory is astonishing, 
given how simple it is to derive them. 
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Theorem 4 (Kummer theory). Under the assumption that K has a primitive n-th 
root of unity, and keeping the earlier notations, the map 


(25) At K(VA) =: E4 


is a bijection between the set of subgroups A of K™ such that K*" © A and the set of 
subfields E of C for which E/K is abelian of exponent n. Moreover E'4/K is finite 
if and only if the group A/K*" is finite, and then there are natural isomorphisms 


(26) GLA KR™ -GEece. 


where G = G(E4/K) (and the convention flagged immediately after the statement 
of F7 is maintained). 


Proof. We must show that the map (25) is one-to-one and onto; everything else then 
follows from F7. 


Surjectivity: Let E/K be abelian of exponent n. Setting 
A:= EX" K* 


we get K(\/A) C E. Suppose that E is not contained in K(*/A) = E4. Then 
there exists a finite subextension F/K of E/K such that F Z E'4. This extension is 
abelian of exponent n because £'/K is. The finite abelian group G(F'/K) is a direct 
product of finite cyclic subgroups. By Galois theory, then, F/K is a composite of 
cyclic subextensions of F'/K; therefore we could just as well have chosen F' so that 
F/K it cyclic, and we assume we did so. By Theorem 1, we have F is generated 
over K by some a@ such that a” € K*. By definition, then, a belongs to E4. But 
then F C Ey, contradicting our assumption. 


Injectivity: In view of the preceding discussion, what is left to prove is: If EF = 
K(‘/A) for some subgroup A of K* containing K*”, then A must coincide with 
the subgroup 


(27) Ag = EX" K™. 


The inclusion A C Ag is clear. Conversely, let a be any element of Ag. Since 
K(4/4g ) = E= K(*VA), there exist elements a1,...,a, of A such that K(#/a) C 
K(2/qy,..., ®/ay ), which means that 


(28) K( Ya, Yar,..., Var) = K(V/ai,..., Yar). 


It suffices to prove that a is an element of the group A’ = (a1,...,a,, K*”"). To do 
this, consider the field 


E! = K(*/a,..., "/a,) = K(V4’). 


Because of (28), a€ Az’. We see therefore that the extension E/K may be assumed 
finite with loss of generality. But taking into account (23) in F7, we have 


A:K*"=ArRr:K*™"<o, 


from which we get the desired equality A = Ag because A C Ag. 
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Remark. Let E/K = K(4/A)/K be any Kummer extension of exponent n as above, 
with Galois group G. Then, even in the case E : K = oo, the nondegenerate pairing 
(22) gives rise to an isomorphism 


G ~ (A/K*")*, 


as can be deduced easily from the finite case, effecting the regress via projective 
limits. Similarly, (22) gives in the infinite case an isomorphism 


A/K*" ~ G*, 


if by G* we understand the group of continuous homomorphisms from the topolog- 
ical group G = G(E/K), with the Krull topology, into the discrete group W,(K). 


3. The preceding field-theoretic considerations are a good motivation for spending 
some time at this point on the structure of finite abelian groups. Although this is 
elementary material that you may have already studied, a brief and self-contained 
exposition of it geared toward our needs is useful. Actually the treatment of abelian 
groups is not at all out of place here; N. H. Abel (1802-1829), in his investigations 
of algebraic equations, made patent right from the beginning the close connections 
between equations and commutative groups. 


Definition. A group G is called a torsion group if for every x € G there exists n €N 
such that x” = 1. 

If p is a prime number, G is a p-group if for every x € G there is a p-power 
p* such that xP" = 1, 


F8. A finite group G is a p-group if and only if its order |G| is a p-power. 


Proof. Let n = |G|. We know that x” = 1 for every x € G. Thus, if 7 is a p-power, 
G is a p-group. If n is not a p-power, it has distinct prime factors g # p. By 
Cauchy’s Theorem (page 101) there exists x € G such that ord x = q. Therefore G 
is not a p-group; if it were, the order of x would be a factor of a p-power. 


Remark. In the case of an abelian group G we need not resort to Cauchy’s Theorem: 
Let G be an abelian p-group of order n > 1. For any x # 1 in G the order of x is 
a p-power p*, that is, the subgroup H = (x) has order p*. By induction we can 
assume that the order of the quotient group G/H is also a p-power. This gives the 
result, since |G| = |G/H|-|H|. 


Convention. In the sequel the group operation for an abelian group A will mostly 
be denoted additively. For x € A and n € Z we then write nx instead of x”. 


Remark. Every abelian group is a Z-module (and vice versa). The point of this 
observation is that the main results about to be stated for abelian groups carry over 
with only minor terminological changes to the context of modules over an arbitrary 
principal ideal domain R instead of Z. We will formulate them for abelian groups 
only, but give the proofs in such a way that they can be generalized without difficulty 
to modules over a PID. 
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Definition. Let A be any abelian group, and let p be a prime number. Set 
Ap = {x € A | there exists k € N such that p*x = 0}. 


Since A is abelian, A, is a subgroup of A. By definition, Ap is a p-group. We call 
Ap the p-component of A. 


F9. Every abelian torsion group A is the direct sum of its p-components; that is, the 
map 
9:@,4p >A 
(Xp)p > yp Xp 
is an isomorphism. 


Proof. (i) Clearly @ is a (well defined) homomorphism of groups. 


(ii) Take x = (Xp)p € ®, Ap, and suppose g(x) = 0. Let p,,..., p, be (distinct) 
indices such that xp = 0 for all p ¥ p;, and consider 


(29) n= pi ... pk with pe = ord Xp,. 

Set 1; = n/ pr for 1 <i <r. Because n,,...,n, are relatively prime, there is a 
relation 

(30) Cyn, +C2N2 +--+ +¢;ny = 1 with c; € Z. 


Set x; = Xp,;. Then 0=njg(x) =n; oy XE = Nix; and hence x; = 1x; = cjnixj =0 
for all 1 <i <r. Therefore x = 0, and we have shown that ¢ is injective. 


(iii) Take any y € A. Since A is a torsion group, there exists n € N such that 
(31) ny = 0. 


Let (29) be the prime factorization of 7, define numbers n; as in part (ii) and choose 
coefficients c; such that (30) holds. Then 


r r 
(32) yolp= >) anys dx, 
i=1 i=1 


with x; = cjnjy. Since DI xi = ci py niy = cjny = 0, each x; belongs to Ap,. 
Define x = (Xp)p € D, Ap by setting xp = x; if p = p; and xp = 0 otherwise; 
then g(x) = )° x; = y, by (32). Therefore g is surjective as well. 


Definition. Let G be a group. We say that G is a group of exponent m if 
(33) x" =1 forevery x EG 


(in the abelian case, mx = 0). 
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Remarks. (1) If G is a group of exponent m and m divides m’, then G is also a 
group of exponent m’. So our terminology is perhaps a bit questionable (but 
see the next definition). It is nonetheless very practical, particularly as regards 
the formulation of Kummer theory results such as the ones above. 


(2) If G is a finite group of order 1 we have x” = 1 for every x € G, so Gis a 
group of exponent n. 


Definition. Let G be a group, and suppose there exists m € N such that (33) holds. 
Among all such m let e be the smallest. We call this number the exponent of G and 
denote it by 

e(G) :=e. 


If no m EN satisfies (33), we set e(G) = 0. Clearly, 
G is finite = e(G)#40 = G isa torsion group 


(and neither implication is reversible, as the examples W(C) ~ Q/Z and RY show). 


F10. Let G be a group such that e(G) 4 0. Any m €N for which (33) holds must be 
a multiple of e(G): 


(34) e(G)|m. 


Thus e(G) is the least common multiple of the orders of elements of G: 


(35) e(G) =Icm {ord x | x € G}. 
In particular, 
(36) ord x|e(G) forallx eG. 


Proof. Set e = e(G) and write m = ge +r by applying division with remainder. 
For every x in G we have 1 = x” = xx", so x’ = 1. If r were not 0 we would 
get the contradiction e <r <e. This proves (34). 


Remarks. (i) If G is finite we have e(G) 4 0, and 
(37) e(G) divides |G]. 
(ii) For G = 83 we have e(G) = |G|. 


(iii) For any finite abelian group G we only have e(G) = |G| when G is cyclic. 
This quite nontrivial fact will become apparent as a consequence of F12 below. 


(iv) We show that (iii) easily implies the following theorem, first encountered in 
Section 9: If G is a finite subgroup of the multiplicative group K” of a field K, then 
G is cyclic. For let n be the order of G and e = e(G) its exponent. Every x € G 
satisfies x® = 1| and is thus a root of the polynomial X¥° — 1 € K[X]. There can be 
at most e such roots in K, son <e. There follows e =n, and then (iii) implies that 
G is cyclic. 
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(v) Consider the direct product G = [|;<; G; of given groups G;, for i € I. Set 
e = e(G) and e; = e(G;). Then obviously 


(38) e =Icm {e; |i € T}. 
(vi) Assume e(G) ¥ 0. If G is a p-group, e(G) is a p-power. 


F11. Let A be an abelian group with e(A) 4 0, and let pi,..., pr be the distinct 
prime factors of e := e(A). Setting Aj = Ap, and e; = e(A;), the map 


A,X ApX+::XApoAdA 
(39) 
ile Nae sag te TS 
is an isomorphism, and 
(40) e=e1e2...€r 


is the prime factorization of e. 


Proof: Ap is zero for every prime p that does not come into the prime factorization 
of e. By F9, then, (39) is indeed an isomorphism. The rest follows from remarks 
(v) and (vi) above. 


Remark. Let A be a finite abelian group of order n and let n = a ... Pr" be the 
prime factorization of n. Then p,..., py are exactly the distinct prime factors of 
e = e(A). The orders of the groups A; of F11 are given by 


(41) |Ail] = pj": 
in other words, A; is the p;-Sylow group of A, for each | <i <r. 


Proof. With the notation of F1l, we have A ~ A; x---x A;, so |A| = []j=, |Ail- 
Being a p;-group, A; has order a power of p; (see F8). The result follows, because 
the prime factorization of n is unique. 


F12. Let A be an abelian group with e(A) 4 0. There is an element a in A such that 
(42) ord a = e(A). 


Proof. (i) We consider first the special case where e := e(A) is a p-power for some 
prime p. For every x € G, the order of x divides e. Since e is a p-power, we have 


ord x = p’), 
with a well defined v(x) € N U0. Now, by (35), 
e= Iem{ p’ |x €A}= prartv@@)ixe4} 


Given a € A with v(a) = max{v(x)|x € A}, then, we have e = p”® = orda. 
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(ii) For the general case we invoke F11. Using the same notation as in that statement, 
we see from part (i) that, since e; = e(A;) is a power of p; for each i, there exist 
a; such that 


(43) orda;j =e; for 1<i<r. 

Consider the element a= (a,,...,d;) in A, X---xA,;. Since (39) is an isomorphism, 
it is enough to show that ord a = e(A). By (40) and (43) we have, on the one 
hand, e(A) = e1e2...e, = ord a, ordaz ... orda;; on the other, each element 
a = (a,...,@,) of the direct product A; x--- x A; obviously satisfies ord ad = 
Icm(ord a;,..., ord a;). But because all the a; have orders relatively prime to one 
another, we have Icm(ord a;,..., ord a,) = ord a; ord a2 ... ord dy. 


Definition. Let G be a group and / a subset of G. Denote by 
(M) 


the intersection of all subgroups of G containing M; equivalently, (/) is the least 
subgroup of G containing M. When M = {a,,...,a,} is a finite set we also write 
(a1,...,@r) for (M). 

When (M)=G we say that M generates G (or that the elements of M do so). A 
group G is called finitely generated if there are finite many elements a),...,a, €G 
such that G = (d1,...,a,). 


Let A be a finitely generated abelian group. By assumption, there are elements 
a,,...,a, in A such that 


r 
A= \ Ye cay lez} = Za,+---+Za, = (a,)+---+ (a;). 
i=1 
First we will assume that A is a torsion group. (Then A is necessarily finite; this 
is easy to check, but we will not make use of finiteness, so that our considerations 
can serve also for the more general case of modules over a PID.) Taking F11 into 
account, we can reduce to the case of a p-group. For that case we have: 


Theorem 5. Let A ¥ 0 be a finitely generated abelian p-group. Then A is a finite 
direct product of cyclic p-groups, that is, 


(44) AXZ/p"'xZ/p2 x-->xZ/p”’, 


where the v; are natural numbers. The numbers s,V1,V2,..., Vs are uniquely deter- 
mined if we impose the condition 


(45) Vp >>... Svs >. 


Proof. Suppose that A is generated by a,,...,a,, where r = r(A) is the minimal 
number of generators of A (i.e., the smallest cardinality of a set that generates A). 
To prove the theorem’s first assertion, we work by induction on r(A), and we show 
that (44) holds with s = r(A). 
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For r(A) = 1 everything is clear. Let r(A) > 1 and denote by e = e(A) the 
exponent of A. Clearly e #0. Since A is a p-group, one sees immediately that one 


at least of a,,...,@, must have order e; suppose without loss of generality that it 
is dy: 
(46) ord a, =e. 


Now consider A = _A/(a1), with quotient homomorphism x +> X. Obviously A= 
(G2,...,4,), so r(A) <r(A). By the induction hypothesis this means that 


(47) A = (bz) x+++ x (bs) 


for certain b; € A (and s <r). Moreover each 6; is replaceable by representative of 
the same class modulo (a1). By (47), we have in any case 


(48) A=Za,+Zb2.4+---+Zbs = (a1, b2,...,5s). 


(This means that r(A) < s, so in fact r = s.) We now show that one can choose 
the 5; in such a way that the sum (48) is direct, which will obviously imply the 
existence part of the theorem. Let e; = ord b;. Then we have 


(49) ejb} =kja,_ with k; € Z. 

Since e; = ord b; lord b;| e, we have e/e; € N. Multiplication of (49) by e/e; yields 
(e/e;)kja, = eb; = 0. 

Using (46) it follows that (e/e;)k; is divisible by e, and hence that k;/e; must be 

an integer. Now set b} = b; —(ki/e;)a1; then b; = b;, and e;b; = 0 because of (49). 


Therefore x _ 
ord bj | e; = ord bj = ord b; | ord 5}, 


so ord b; = e;. Thus we can assume without loss of generality that 
(50) ord bj =e; =ordb; for all 2<i <s. 


But in this case (48) is a direct sum, because an equality of the form 


S 
(51) Y > cibi =0, with c € Z (and by := a1) 
i=1 


s —_ — 
implies first )> cb; = 0 (since b; = 0) and then, using (47), 
i=2 
ord b; | ¢ for 2<i<s. 
Thus we conclude from (50) that ord ; divides c;, and hence 


cb, =0 for 2<i<s. 


But then c;4, also vanishes, so all summands in (51) are zero. 
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We must still show the uniqueness of the representation in Theorem 5. Assume that, 
besides (44) and (45), there holds 


AXZ/p™ xZ/p*? x-->xZ/p™, 


where [41 > 2 >... > (tr = 1 are natural numbers. Clearly p“! = e(A) = p”!, 
so certainly 4; = v;. We now apply induction on e(A). Suppose e(A) > p. The 
subgroup pA of A obviously satisfies 


pAXZ/p"'"|x---xZ/p’s Tn / ph! Pcs 7 pe, 


The induction hypothesis then implies immediately that v = (11,..., vs) and uw = 
(441,..., (44) already coincide at all entries except perhaps those of value 1; in other 
words, there exists d > 1 such that vg > 1 and that 


v=(y,...,¢g,1,...,]1), w=(1,...,v¥a,1,...,1). 
——” ——"’” 
nentries m entries 


To show that the number of 1’s is the same for jz and v, consider the subgroup 
A(p) = {ae A| pa=0} of A. We have 


A(p) = (Z/p)4 x (Z/p)™” ~ (Z/p)4 x (Z/p)”. 


This implies m =n, because we are talking of vector spaces over the field Z/p. 
The same argument also serves to start the induction, in the case e(A) = p!. 


(The last step of the proof can be simplified by considering the order of the 
group; but again we refrain from using this argument so the proof applies more 
generally.) 


Theorem 5’. Let A 4 0 be a finitely generated abelian torsion group (that is, a finite 
abelian group). Then A is a finite direct product of cyclic groups. More precisely, 
there exist natural numbers e€,@2,...,@s > 1 such that 


(52) ejsile; for l<jxs-l 

and that the isomorphism 

(53) Aw Z/e,xZ/erx---xZ/es 

holds. The numbers s,e1,...,@s are uniquely determined. 

Proof. If p1,..., pr are the distinct prime factors of e = e(A) ¥ 0, we have by F11 
(54) A=A,xX:::x A; with A; = Ap,. 

Now decompose the A; according to Theorem 5: 


Ss 
(55) Ai~ | [Z/p7%”, 
j=l 
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where we pad as needed with v(i, 7) = 0, in order to be able to have the same 
number s of factors for each 7. Multiply together the highest prime powers, one for 
each i, then the second highest, and so on. This yields a sequence 


(56) ej = pep ba 4s pvt) 


of natural numbers satisfying (52) and e; > 1. Now, because of (56), we have 


r 
Z/e; =~ [[Z/p7"”. 


i=1 


Thus by combining the factors in the same pattern as the prime powers, one obtains 
the isomorphism (53). 

The uniqueness part of Theorem 5’ is true because the A; = Ap, are canonically 
determined and their decompositions according to Theorem 5 are also unique (for 
each type). 


Theorem 5’ wholly explains the structure of finitely generated abelian groups 
that are torsion groups. To deal with the general case, we first look at the diametric 
opposite of torsion groups: 


Definition. A group G is called torsionfree if, apart from the identity, no element 
of G has finite order. 


F13. Let A be any abelian group. Then 
Ar ={x €A| there existsn € N such that nx = 0} 


is a subgroup of A. By definition, Ar is a torsion group. The quotient group A/Ar 
is torsionfree. 


The easy proof is left as an exercise. 
Theorem 6. [f A # 0 is a torsionfree finitely generated abelian group, then 
AxZx:xZ=T’, 
where r := r(A) is the minimal number of generators of A. 


Proof. We wish to show that A is free abelian in r generators, which by definition 
means that A ~ Z”. We first establish that A is isomorphic to a subgroup of Z”, then 
show that any such subgroup is isomorphic to Z% for some d <r. That settles the 
question, since r < d as well (obviously Z@ has a generating set with d elements). 

Suppose A = Za; + Za, +---+Za,. After renumbering, let {a;,...,a,} be 


a maximal linearly independent subset of {a1,...,a,}. (Linearly independent here 
means that if }° cja; = 0 with c; € Z then cy = cp =--- = Cy = 0.) In view of the 
definition of a,,...,dy, there is for each i with n <i <r arelation of the form 


n 
MjQqj = ) cjaj for some nonzero m; € Z. 
j=l 
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Taking the lcm we see that there exists m € Z \ {0} such that 
ma; € Za,+-+:+Za,_ for every 7. 


Setting F := Za; +---+ Zan, we rewrite this as 


(57) mAC F. 
Now F, by its definition and the linear independence of a1,...,dn, is free abelian 
inn generators: an isomorphism F ~ Z” is given by (x1,...,Xn) >. x;q;. At the 


same time A and mA are isomorphic (by the map a+ ma, since A is torsionfree). 
So, as announced, 


(58) AxmACFrZ" withn<r. 


Our theorem will be completely proved after we show that any subgroup of Z” is 
isomorphic to Z4 for some d <n (where we allow d = 0 to represent the zero 
group). 

We do this by induction on n. The case n = | is clear: every nonzero subgroup 
of Z is of the form mZ ~ Z. Suppose n > 1 and denote by a: Z” — Z the 
projection onto the n-th factor: m(x1,...,Xn) = Xn. Let B C Z” be our subgroup 
and take B‘ = BNker 7; this is isomorphic to a subgroup of Z”~!, so the induction 
assumption implies that 


Bi’~Z*, with k<n-1. 
If B = B’ we are done. If instead B ¥ B’, it is easy to check that 
(59) B= B'xZa, 


where a € B is such that sra generates the subgroup 7B. Thus B ~ Z*xZ ~ 7" 
with k + 1 <n, and our assertion is proved. 


Remarks. (1) Should anyone need convincing that Z” and Z” can only be iso- 
morphic if 7 =n, here is one reason: An isomorphism between Z” and Z” 
implies, for any prime p, an isomorphism between pZ” and pZ”, and so also 
an isomorphism between 2” /pZ™ ~ (Z/p)™ and Z"/pZ" ~ (Z/p)". But 
then m =n follows by comparing dimensions. 

(2 


wa 


The existence of the direct-sum decomposition (59) falls under the following 
principle: Given an exact sequence 


(+B 4A—S cS 0 


of abelian groups, if there is a homomorphism o :C — A such that moo = idc, 

then A is the direct sum of iB and oC, and hence A= BxC. The simple 

proof is left to the reader; note that the endomorphism ¢:= 007 of A satisfies 
2 

& = 6&, 
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Theorem 7 (Classification of finitely generated abelian groups). Let A be a finitely 
generated abelian group. There exist integers C,,C2,...,Cs = 0, all distinct and 
different from 1, satisfying 


(60) cjloj41 forl<j<s-l, 


and such that 


(61) A2rZ/e,xZ/crx-+++xZ/Cs. 
The numbers s,c,,...,Cs are uniquely determined; c1,...,Cs are called the invari- 
ants of A. 


Proof. Let Ar be the torsion component of A and set A = A/Ar (see F13). Being 
a homomorphic image of A, the group A is finitely generated. It is also torsionfree, 
so Theorem 6 shows that 


A~Z’=Z/0x:--xZ/0 


for some r > 0 (with the obvious interpretation in the case r = 0). Denote by 
qt: a —> @ the canonical homomorphism from A to A, and let @,,...,d@, form a Z- 
basis of A. There exists exactly one homomorphism o : A > A such that 0 (G;) = aj 
for all 1 <i <r; it satisfies 7 oo =idz. By Remark 2 after Theorem 6, then, 


AxArxA: 


in particular Ay is also a homomorphic image of A, and so finitely generated. 
By Theorem 5’, then, there is a (possibly empty) family of natural numbers 
C1,C2,...,€g > 1 such that 


Ap ~Z/eyx-++xZ/cq and cj|cj44. 
Setting cg41 =-++: = Cg+r = 0, we obtain (60) and (61). 


The uniqueness part, which is very important, comes out easily from the uniqueness 
statements in Theorem 5’ and Remark | to Theorem 6. 


Remark. Still in the situation of Theorem 7, the number s equals r(A), the minimal 
number of generators of A. For obviously r(A) < 5s. Suppose s = 1. If p is a prime 
divisor of c; we have A/pA ~ (Z/p)*, sos =r(A/pA) < r(A). 


In the framework of the question we are pursuing, Theorem 7 represents a fully 
satisfying outcome (and one which, as mentioned, extends to modules over any 
principal ideal domain R instead of Z). In some other situations, however, one 
often needs a stronger result: 


Theorem 8. Let F ~ R” be a free module in n generators over a principal ideal 
domain R, and let N be a submodule of F. There exists a basis by,...,bn of 
F and elements c,,...,Cm (where m <n) in R ~ {0} such that cj|cj+1 and that 
c1b1,...,Cmbm form a basis of N. The number m and the elements c1,...,Cm are 
uniquely determined apart from multiplication by units. 
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Proof. If we take existence as proved, we get an isomorphism 
F/N ~ R/cy X+++X R/¢m x R"™; 


then uniqueness follows immediately from Theorem 7 (but note that here the early 
cj’s can be units in R). 

One might think that the existence part can also be plucked somehow from 
Theorem 7, but that impression is deceptive. We are better off resorting to the 
methods of linear algebra. Both R-modules, F and N, possess R-bases, one with 
n and the other with m elements, where m <n. (Regarding N see the discussion 
following (58) on page 162.) The assertion now follows directly from a well known 
result: 


Invariant Factor Theorem. Let A be ann xm matrix over a principal ideal domain 
R, with m <n. Then A is equivalent over R to a matrix obtained from anim xm 
diagonal matrix by adding n—m rows of zeros and where the diagonal entries c1,..., 
Cm satisfy Ci | Ci+1- 


This is proved in LA II, p. 148, in the case where R is a Euclidean ring (also 
with the assumption m =n, but that makes scant difference.) If R is a PID but not 
Euclidean, it’s not possible to get by with only the usual elementary operations, and 
the reader might ponder what to do instead. Note that certain other manipulations 
of two rows or columns are also permissible, whereby the leading entries a, b are 
replaced by d,0, with d = gcd(a, db). 


Apart from uniqueness, Theorem 7 is an easy consequence of Theorem 8: Let 
M bea finite generated module over a principal ideal domain R and n the minimal 
number of generators of M/. Then M ~ R"/N. Theorem 8 gives 


M ~ R/cy X+++x R/cm x R™™. 


By the minimality of 1, none of the c; is a unit in R. Hence we get a decomposition 
of the desired form, setting c; := 0 form <i <n. 
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Solvability of Equations 


1. Let K be a field. The roots of a quadratic polynomial 
(1) {(X)=X* + pX +4 


can be represented, in a splitting field E of f over K, as 
2 
(2) -£ +VJd, where d= (2) —4; 


here Vd denotes an element of E whose square is d, and we have assumed that 
char K 4 2.' Moreover, 


(3) 4d = p*—4q 


is the discriminant of f in the sense of Section 8.3. 


In the early sixteenth century Italian mathematicians (Scipione Ferro, Nicolo 
Tartaglia, Girolamo Cardano) found out that the roots of a real cubic polynomial 


(4) f(X) = X37 4+ pX+¢q 


too can be expressed using appropriate radicals: namely, in the form 


9 POE + HVO+e 


where the cube roots must be suitably interpreted (see F10). The same formula finds 
the roots of f € K[X] for any field K, provided of course that char K 4 2, 3. Under 
this assumption, moreover, any cubic polynomial g(X) = X¥3+aX7+bX +c can be 
reduced to a polynomial of the form (4), via the transformation f(X):= g(X — za). 


! If char K = 2, the polynomial / in (1) is inseparable if and only if p = 0. If this is the case, 
J has as its only zero the unique square root of q. If p 4 0, the roots of f can be expressed 
in terms of a solution a of the equation Y* — Y = q/p?: they are pa and p(a+1). 
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The question whether similar formulas exist for the roots of polynomials of any 
degree occupied mathematicians for a long time. It was finally answered in the 
negative by Abel in 1826, and it provided a decisive impetus for the work of Galois 
(1831). 

This problem of solvability of equations by means of radicals is close in spirit 
to the problem of constructibility with ruler and compass, discussed in Chapter 1. 
We will proceed in a similar way as we did then, with the difference that we now 
have at our fingertips a well-developed conceptual apparatus. 


2. First we have to make precise what is meant by solving an equation by radicals. 


Definition 1. (i) Let F/K be a field extension. We say that F arises from K by 
the successive adjunction of radicals (of exponents nj,...,Ny respectively) if 
there is a finite chain 


(6) K=Ky CK, G:C K, =F 


of intermediate fields K; of F/K such that each K; is obtained from K;_; by 
the adjunction of an 7;-th root (also called a radical of exponent n;). In this 
case we call F'/K a radical extension. 


(ii) An extension E/K is called solvable by radicals (of exponents n1,...,Nr 
successively) if there is a field extension F/K such that E C F and F arises 
from K by the successive adjunction of radicals (of exponents 11,...,, 
respectively). 


(iii) A polynomial f(X) € K[X] is called solvable by radicals (of exponents n1, 
...,My successively) if there is an extension E'/K that is solvable by radicals 
(of exponents n1,...,Ny successively) and such that f splits into linear factors 
over E. 


Remarks. (a) Obviously, any radical extension can be obtained by successively 
adjoining radicals of prime exponents. 
(b) Let F,, F, be intermediate fields of an extension C/K and let F; Fy be their 
composite in C. Clearly, if F/K is a radical extension, so is F F>/F 2; and if 
F\ F,/F and F,/K are radical extensions, so is F, F,/K. Putting it together 
we see that if F,/K and F,/K are both radical extensions, so is Fy F2/K. 
Corresponding statements follow for extensions solvable by radicals. 


(c 


Ne 


If F/K is a radical extension and F” is the normal closure of F/K, the exten- 
sion F’/K is also radical. This follows from (b), because F” is the composite 
over K of the fields conjugate to F (inside some algebraic closure C of F). 
The corresponding statement for extensions solvable by radicals follows. 


(d) From (c) we derive: If f € K[X] is irreducible and E/K is an extension 
solvable by radicals such that f has a root in EF, then f is solvable by radicals. 


(e) Any finite and purely separable extension E'/K is a radical extension; see F15 
in Chapter 7. 
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Next we consider a radical extension F'/K, assumed to be Galois, and we inves- 
tigate what consequences can be derived regarding the Galois group G = G(F/K). 
Take a chain of subfields (6) with all terms different and such that the adjoined 
radicals all have prime exponent; that is, assume that 


(7) K; = Ki-1 (aj) with a; ¢ Kj-1, aPi € Kj-1, pi prime 


for alli. Then p; #char K for alli; otherwise K;/K;—1 would be purely inseparable 
of degree p;, contradicting the separability of F'/K (see F3 in Section 7). If we set 


N= Pi pr... Dr, 


there is a primitive n-th root of unity ¢ in the algebraic closure C of F. We now 
make the additional assumption that 


CeEK. 
Then, by F1 in Chapter 14, each extension 
(8) K;/Ki-1 is cyclic of degree pj. 


Now, by Galois theory, the chain (6) of intermediate fields of the extension F/K 
corresponds to a chain 


G=HW2H,2>::-DH,=1 


of subgroups H; = G(F/K;i) of G = G(F/K), and because of (8), each H; is a 
normal subgroup of H;—1, with quotient group H;—,/H; cyclic of order p;. Thus 
we are led to: 


Definition 2. A group H is called solvable (or metacyclic) if there exists a chain 
0) Cee tet eee ae 
of subgroups H; of G such that 
(10) H; < H;-, and Hj-,/H; is cyclic of prime order. 
(The Hj; need not be normal subgroups of G.) 
We next assemble a list of simple facts about solvable groups: 


F1. Let G be a finite group. 
(a) If G is solvable, so is any subgroup of G. 
(b) If G is solvable, so is any quotient group G/N of G. 
(c) Suppose N 1G. If N and G/N are solvable, so is G. 
(d) If G is abelian, G is solvable. 
(e) If G is a p-group, G is solvable. 
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We postpone the proof for a bit, so we can get to something meatier right away: 


Theorem 1. /f E/K is an extension solvable by radicals and E' is the normal closure 
of E/K, the automorphism group G(E'/K) is solvable. 


Proof. By Remark (c) following Definition 1, E’/K is solvable by radicals if E/K 
is. Thus we may as well assume that E/K is normal: E = E’. By assumption 
there is a radical extension F'/K such that E is a subfield of F. Again by Remark 
(c) we can assume that F'/K is normal. Now, the natural map G(F'/K) > G(E/K) 
is surjective; thus by part (b) of F1, it suffices to show that G(F/K) is solvable. 
Altogether this shows that we can assume that E'/K is both a radical and a normal 
extension; we further assume that F arises from K by the successive adjunction of 
radicals of prime exponents p1, P2,..., Pr. 


Now let 7 be the product of all the p; such that p; 4 char K, and fix a primitive 
n-th root of unity ¢ (in the algebraic closure of E). Consider the diagram of fields 


E(S) 


E 4 % 
Ss al 


Clearly, E(¢)/K is normal. Moreover E(6)/K(¢) is a radical extension with 
exponents pi,..., pr, since E/K is one. By F9 in Chapter 9, the group G(K(¢) /K) 
is abelian, and thus solvable, by part (d) of Fl. Thus, if we take the solvability of 
G(E(¢)/K(6)) as granted, part (c) of Fl shows that G(E(¢) /K) is solvable and thus 
also G(E/K), by part (b) of F1. What is left to prove, then, is that G(E'(¢) /K(6)) is 
solvable, or, otherwise put, that G(E'/K) is solvable under the additional assumption 
that Ce K. 


(1) 


If E/K is Galois, there is nothing left to show, in view of what we said before 
Definition 2. The general case is completed by induction on r. For r = 0 there is 
nothing to show. Thus take r > 0 and assume without loss of generality that Ki # K. 
If p; =char K, the extension K,/K is purely inseparable, so G(K1/K) = 1; if pi ~ 
char K, on the other hand, G(K;/K) is solvable. The induction hypothesis applied 
to the normal radical extension £/K, says that G(E/K1) is solvable. Because 
G(K,/K) and G(£/K,) are solvable, so is G(E/K), by part (c) of FI. 


The reader is encouraged to work out the proof again under the assumption that 
char K = 0, which makes it less fussy. 


Proof of F1. (b) Let 1 : G — G be a surjective homomorphism. Given a chain (9) 
of subgroups of G, where each H; is a normal subgroup of H;—,, the application 
of a yields a chain 


G=nG=nH) 2a, 2-2 aH, =1, 
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where again each mH; is a normal subgroup of 7H;-;. Moreover z gives rise to 
surjective homomorphisms 


wy: Ay-1/H; > wHj-1/1 Hj. 


If H;-,/H; has prime order, the Fundamental Homomorphism Theorem for groups 
and the Euler-Lagrange Theorem imply that either 7 H;~; = 2H; or 2 Hj-\/n H; ~ 
H;—,/H;. This proves that G is solvable. 


(c) We can start with a chain 
G/N =U) DU, D::-DUy = 1 


of subgroups G/N, where each U; is a normal subgroup of Uj—1 of prime index, 
and a similar chain 


N = Hm 2> Hm412...2 Hy =1 


for N. Let 1: G— G/N be the quotient homomorphism, and consider the sub- 
groups z~!(U;) of G. We have 1~!(Um) = 2~!(1) = N = Hy, so there is no 
ambiguity in notation if we set 


ca a! (U;) for i=0,1,...,m. 
Of course H; is a normal subgroup of H;—-1;; moreover z yields an isomorphism 


Ay) /H; ~ Uj-1/ Uj. 


Thus we obtain a chain 


G = Hy 2 MH, 2::- > Hm > Hm+1 2+: 2> An =1, 


where each H; is a normal subgroup of Hj;—; with quotient group H;—,/Hj; cyclic 
of prime order. Therefore G is solvable. 


(d) Let G be a finite abelian group. We prove that G is solvable by induction on 
the order of G. Leaving aside the trivial case G # 1, take an element o € G whose 
order is a prime p. Let N be the subgroup generated by o. Since G is abelian, N is 
trivially normal in G. By the induction assumption, G/N is solvable. Since N too 
is solvable, being a group of prime order, part (c) above shows that G is solvable. 


Moreover using (d) and (c) we see by induction that, if a finite group G admits a 
chain of subgroups 
G=HAoMC::-bHA,=1 


for which all the quotient groups Hj—1/Hj are abelian, G is solvable. Thus we can 
forgo the stipulation that the H;_,/H; in (10) have prime order. On the other hand, 
we remark that any solvable group G admits a chain G = No D Nj D--- DN, = 1 
where each Nj is a normal subgroup of G, so that the quotient groups Nj—1/N; are 
all abelian (see $15.2 in the Appendix). 
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(e) The solvability of p-groups is guaranteed by the (stronger) statement of F8 in 
Chapter 10, again taking into account that a group of prime order is automatically 
cyclic. 


(a) Let H be a subgroup of G. Starting from a chain (9) of subgroups of G having 
the properties listed in Definition 2, we obtain by intersection with H a chain 


H=A)NHADMNAD::-2DANH=1. 


For each i = 1,2,...,7, consider the restriction 2; of the quotient homomorphism 
H;_, > H;_,/H; to the subgroup Hj_-,N HA of Hj_,: 


mj: Hj H > Hj-1/H;j. 


Its kernel is (Aj-10 H) 1 A; = Hj; H. Thus AH; OM FA is a normal subgroup of 
Aj—, 0 H, and x; yields an injective homomorphism 


A;-,N H/HjiNH > Ay, /H;. 


Thus, either H;_; M H coincides with H;N H or Hj-;QNH/H;NH ~ H;_,/H; is 
cyclic of prime order. The solvability H follows. 


We take the opportunity afforded by the preceding considerations to formulate 
the relevant isomorphism theorem: 


F2 (Noether’s isomorphism theorem for groups). Let N be a normal subgroup of 
a group G, with quotient map xn: G > G/N. For every subgroup H of G, the 
restriction of x to H gives rise to an isomorphism 


H/HON ~ x(H): 


moreover m~'(1(H)) = HN, so x also gives rise to an isomorphism HN/N ~ 
(HH). Altogether one gets a natural isomorphism 


(12) HN/N ~ H/HON. 


Getting back to the problem of departure: we now show that, with the right 
stipulation regarding the characteristic, Theorem | admits a converse. (The reader 
might consider first the case char K = 0, where certain technical complications do 
not arise.) 


Theorem 2. Let E/K be a finite field extension and let E' be the normal closure 
of E/K. If the group G(E'/K) is solvable and its order is not divisible by the 
characteristic of K, the extension E'/K is solvable by radicals. 


Proof. Clearly we may as well assume that E/K is normal (E' = E). Let Es be the 
separable closure of K in E. The extension E/E, is radical (see Remark (e) after 
Definition 1), so it suffices to show that E/K is solvable by radicals. However, 
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E;/K is Galois (because the normality of E/K obviously implies that of E,/K); 
since G(E;/K) ~ G(E/K) we can therefore assume that £/K is Galois. 

Let G be the Galois group of E/K, of order n = E: K. Since char K does not 
divide n, the algebraic closure C of E contains a primitive n-th root of unity ¢. Now 
consider again the diagram (11). Trivially, K(¢)/K is a radical extension, since ¢ is 
an n-th root of 1. Therefore it is enough to show that E(¢) /K(€) is solvable by rad- 
icals (because then so is £(¢)/K, and a fortiori E/K). By the Translation Theorem 
of Galois theory (Chapter 12, Theorem 1), the group G(E(¢) /K(¢)) is isomorphic 
to a subgroup of G = G(E/K); and since G is solvable, so is G(E(f)/K()), by 
F1. Thus we see that we can assume without loss of generality that the ground field 
K contains a primitive n-th root of unity. 

That G = G(E/K) is solvable means there is a chain 


G=MHW2H,2::-DH,=1 


of subgroups H; of G such that each H; (where | <i <1) is a normal subgroup of 
Hj—1, with cyclic quotient Hj—1/H; of prime order p;. By Galois theory we have 
an associated chain 

K=KyOK\C--CK,=E 


of intermediate fields K; of E/K, where each extension K;/K;_, (for 1 <i <r) 
is Galois, with Galois group isomorphic to Hj—1/H;. Thus 
(13) G(K;i/Ki-1) 


is cyclic of prime order p;. Since K;—1 contains a primitive p;-th root of unity 
(since p; divides n), Theorem 1 of Chapter 14 says that K; arises from K;-; by 
adjunction of a p;-th root. This completes the proof of Theorem 2. 


Remark 1. We can extend the notion of a radical extension given in Definition | 
(and likewise that of an extension solvable by radicals) by allowing the possibility 
that, if char K = p > 0, some elements Kj; in the chain (6) be obtained from K;_, 
by the adjunction of a root of a polynomial of the form X¥? — X¥ —y € Kj-,[X] 
(compare Theorem 3 in Chapter 14). Then the statement of Theorem | remains valid, 
and Theorem 2 is valid without any condition on the characteristic. The proofs are 
analogous to the ones given earlier, with recourse to Theorem 3 of Chapter 14 when 
required. 


Remark 2. Let F/K be a field extension. We will say that F arises from K by the 
successive adjunction of irreducible radicals of prime exponents p,,..., pr if there 
is a finite chain 

K=Ko)0CKiC::-CK,=F 


of intermediate fields K; of F/K such that each K; arises from K;_; by the ad- 
junction of a root of an irreducible polynomial of the form 


X?i —y; € Kj-,[X], with p; prime. 
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An extension E/K is called solvable by irreducible radicals if E is a subfield of 
a field F that can be obtained from K by the successive adjunction of irreducible 
radicals. 

If E/K is such an extension and E’ is the normal closure of E/K, the extension 
G(E'/K) is solvable, by Theorem 1. For the converse we have to strengthen a bit 
our assumption on the ground field K: 


Theorem 2’. Let E/K be a finite extension, with normal closure E'. If the group 
G(E'/K) is solvable and the characteristic of K is either zero or greater than all 
primes dividing the order of G(E’/K), the extension E/K is solvable by irreducible 
radicals. 


The proof follows the same pattern as that of Theorem 2: First we can assume 
that E'/K is normal. Since any purely inseparable extension is always solvable by 
irreducible radicals, we can also assume that E/K is Galois. Now everything goes 
through as in the proof of Theorem 2, assuming the following result to have been 
proved: 


Lemma 1. Let K be a field and n a natural number. Let K' be the field of n-th 
roots of unity over K. If the characteristic of K is either 0 or greater than all prime 
factors of n, the extension K'/K is solvable by irreducible radicals. 


Proof. We work by induction on n. The cases n = 1, 2 being trivial, assume n > 2. 
There is a primitive g(n)-th root of unity 7 in the algebraic closure C of K’, because 
no prime factor of y(m) is greater than all prime factors of n (formula (29) in 
Chapter 9). Now consider the diagram 


K'(n) 


a, 
of 


Since y(n) <n, the extension K(n) /K is solvable by irreducible radicals, by the 
induction hypothesis. We thus see that we can henceforth assume that the ground 
field K contains a primitive y(7)-th root of unity. Now, G(K’/K) is isomorphic to 
a subgroup of the group of prime residue classes modulo m (Chapter 9, F9). Thus 
G = G(K'/K) is certainly abelian, and its order divides y(n). Every prime factor 
q of |G| is therefore a factor of y(m). Since abelian groups are solvable, G has a 
chain of subgroups 


K(n) 


G=H2> HM, 2-:->H,=1 


where each H; (for 1 <i <7) is anormal subgroup of Hj; or prime index qg;. By 
the Galois correspondence we get a corresponding chain 


K=KyOK\C--C K,=K’ 
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of intermediate fields K; of K’/K, where each extension K;/Kj—, is Galois with 
Galois group of prime order qg;. Each q; divides y(n); thus K and therefore also 
K;-, contain a primitive q;-th root of unity. By Theorem | in Chapter 14, the field 
K; arises from K;—, by the adjunction of an irreducible radical. Therefore K’ arises 
from K by the adjunction of irreducible radicals. 


Let f € K[X] be a polynomial over a field K, and let FE be a splitting field of 
f over K. The automorphism group G = G(E/K) is also called the group of f 
(or of the equation f = 0) over K. If all prime factors of f are separable in K[X], 
the extension E/K is Galois; in this case G is called the Galois group of f over K 
(compare Chapter 8, Definition 3). An arbitrary polynomial f € K[X] is solvable 
by radicals if and only if E/K is solvable by radicals; see Definition 1(iii). With 
this, Theorems | and 2 imply: 


Theorem 3. Let f be a polynomial over a field K. If f is solvable by radicals, the 
group of f over K is solvable. Conversely, if the group of f over K is solvable and 
its order is not a multiple of the characteristic of K, the polynomial f is solvable by 
radicals. 


Remarks. (1) In characteristic 0, Theorem 3 becomes: A polynomial f € K[X] 
is solvable by radicals if and only if its group over K is solvable. 

(2) If the characteristic of K is 0 or greater than all prime factors of |G|, Theorems 
3 and 2’ together imply: A polynomial f € K[X] is solvable by radicals if and 
only if it is solvable by irreducible radicals. 

(3) If we use the extended notion of radicals introduced in Remark | to Theorem 2 
(namely allowing roots of polynomials X? — X —y where p =char K > 0), the 
italicized statements in parts (1) and (2) hold regardless of the characteristic. 


3. The foregoing considerations raise two obvious questions: 
A. Are there any finite groups that are not solvable? 
B. What finite groups occur as Galois groups of Galois extensions? 

We first take up question B and show that, for any n, the symmetric group Sn 
is the Galois group of an appropriate Galois extension. To this effect we consider, 
over some arbitrary ground field k, the field k(X1, X2,..., Xn) of rational functions 
in the n variables X,,..., Xn (that is, the fraction field of the polynomial ring 
k[X1,..., Xp] in the n variables X1,..., X, over k). For every rE k(X1,..., Xn) 
there exist polynomials g1, g2 € k[X,..., X,] such that 


— 21 
(14) fem ae 


Of course this representation is not unique, but we have 


(15) peels th < gih2 =hygo. 
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Take @1,...,@, in some extension k’ of k. The expression 
1,...,@ 
(16) is egies 
2(4, Sie An) 


is well-defined when it is defined at all, that is, when in the representation (14) we 
can choose gy so that g(a@1,...,@,) # 0. This can be easily seen from (15). 
We now study the polynomial 


n 
(If) =T[ X= ¥1) = XH XI $9 XH CD" 
i=1 
over k[X,,..., Xn] and over k(X1,..., X,). Note that s; € k[X1,..., X,], so we 
write sj = sj(X1,..., Xn). We call s; the i-th elementary symmetric function. We 


have 
Sp =Xy~+ Xo+--- +X, 


So = XM X_ +0 + XM Xn + + Xn-1 Xn, 


Sy = X1,X2...Xn. 


The symmetric group S, acts in a natural way on k[X1,..., Xn] and k(X1,..., Xn): 
Given o € Sy, we set 


oh(X), ate Xn) = h(Xoa). Xo2), eis ey 


that is, the map hh oh of k[X1,..., Xn] into itself is the unique homomorphism 
of k-algebras 


(18) kK[X1,...,Xn] > k[M1,...,Xn] such that X; H Xo). 


It is clearly an isomorphism. Next, the map h & oh on k(Xj,..., Xn) is the 
unique extension of (18) to the fraction field. In this way each o € S;, defines a 
k-automorphism of k(X1,..., Xn). In the sequel we will regard S, as a subgroup 
of the automorphism group of the extension k(X1,..., Xn) /k: 


Sy © Aut(k(X1,...,Xn)/k) © G(k(X1,...,Xn)/k) 


(see page 66 for notation). For conciseness we set F = k(X1,..., Xn). Then 
the elements o € S, act (coefficientwise) on the ring FLX] of polynomials in one 
variable X over F. The polynomial f in (17) satisfies 


n n n 
of =] ][(¥-0(%) =[]¥-Xw) =] ]X-X =F. 
i=1 i=1 i=1 
Thus the coefficients s; of f lie in the fixed field of S,, ink(X1,..., X,). But clearly 
k(X,..., Xn) is a splitting field of the separable polynomial f(X) = [];(X —X7) 
over the field 
K(81,82,---55n)- 


Every finite group is a Galois group. Symmetric functions 175 


Therefore k(X1,..., Xn)/k(S1,...,5n) is a Galois extension. Every automorphism 
of this extension permutes the roots X;,...,X, of f, and as just seen every per- 
mutation of X;,..., X, is obtained in this way. Thus 

(19) G(K(X1,...,Xn)/k(s1,...,5n)) = Sn. 


With this we have exhibited S,, as the Galois group of a field extension. 


Definition 3. An element r € k(X,,..., X») is called symmetric if 
or=r _ forall o € Sy, 


that is, r =r(X1,..., X») remains unchanged by any permutation of the variables. 


Thus the symmetric functions of k(X,...,X,) are precisely the elements of 
the fixed field of S, in k(X1,..., X;). Taking (19) into account we see by Galois 
theory (Chapter 8) that the field of symmetric functions of k(X1,..., Xn) coincides 
with the field k(s1,...,5,). To summarize: 


F3. Consider over the field of rational functions k(X,,..., Xn) the polynomial 
n 
T(X)= [ [@-%) =x" =, x"! + 5yX"? +--+ (-1)"5q 
i=1 


whose coefficients $1,..., Sn are the elementary symmetric functions in X1,..., Xn. 
The extension k(X,,..., Xn)/k(s1,...,5n) is Galois, and its Galois group is natu- 
rally identified with the symmetric group Sy. A rational functionr € k(X1,..., Xn) 
is symmetric if and only if it lies in k(s,,...,5n); that is, if and only if it can be 
expressed as a rational function in the elementary symmetric functions $1,..., Sn. 
The polynomial f (X) is irreducible over k(s1,..., Sn). 


Proof. Only the last statement has not yet been proved. S, is the Galois group of 
f over k(s1,...,8,). Since S, acts transitively on the roots X1,..., X, of f, the 
irreducibility of f is guaranteed by F7 in Chapter 8. 


We will now look at things from a different angle. As before, let k be an 
arbitrary ground field. Let K = k(u1,...,Un) be the field of rational functions in 
the n variables u1,U2,...,Un over k. In the polynomial ring K[X] over K, consider 
the polynomial 


(20) e(X) = X" uy XN" | hag x” 4 1)": 
This is called the general polynomial of degree n over k. 


Theorem 4. The general polynomial of degree n over k is separable, and it is ir- 
reducible over the field K = k(uy,...,Un) of its coefficients. Its Galois group over 
K (also known as the “Galois group of the general equation of degree n over k”) is 
isomorphic to the symmetric group Sn. 
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Proof. Let E be the splitting field of g over K =k(u1,...,Un). Over E we have 
n 

(21) g(X) =] [(x-xi). with x; € E. 
i=1 


Then EF = K(x1,...,Xn) = kK(xq,...,Xy). Is E a field of rational functions 
in n variables over k? To answer the question, consider the polynomial ring 
k[X1,..., Xp] in the n variables X1,..., X, over k, and as before take the polyno- 


mial 
n 


(22) f(X):= []X- Xi). 
Then let ae 
(23) K[X,..., Xp] k[xq,.-. Xn] 


be the unique homomorphism of k-algebras taking each X; to x;. This map obvi- 
ously satisfies 
5) = 8i(M%,..., Xn) $j (41,...,Xn) = Uj. 


Therefore (23) gives rise to a homomorphism of k-algebras 


(24) k[s1,..-,5n] > A[u1,...,Un] such that 5; u;. 

This is an isomorphism: indeed, since k[u1,...,uW,] is the polynomial ring in the 
variables u1,..., UW», there is a homomorphism of k-algebras 

(25) kluy,...,Un|] > k[s1,...,5,] such that uj bh 5;, 


and the maps (25) and (24) are clearly inverse to each other. The isomorphism (24) 
has a unique extension to the fraction field: 


(26) K'’ =k(s,,...,8,) > K =k(uy,..., Un). 


We now claim that (23) too is an isomorphism. In particular, x1,...,X, are 
pairwise distinct and hence g is separable. The surjectivity of (23) is clear; we must 
prove its injectivity. Let h(X1,..., Xn) € k[M41,..., Xn] satisfy h(x1,...,Xn) = 0, 
and consider 

Nh) = [| oh =h-[[ oh ee k[X,..., Xn. 
o€Sn oFl 
Clearly N(/) lies in the fixed field of S,, so N(h) €k(s1,...,5n). Also N(A) lies in 
the kernel of (23) because / does. But the map (23) coincides with the isomorphism 
(26) on k(s1,...,5n) k[X,..., X,]. Thus N(h) = 0 and hence A = 0. 

We now can extend (23) in unique fashion to an isomorphism k(X1,..., Xn) > 
k(x1,...,Xn) of fraction fields, and taking this together with (26) we get the com- 
mutative diagram 


BOG ecg RS ROA ae) 
(27) 
K = K 
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The isomorphism (26) gives rise to an isomorphism 
K'[X]—> K[X] 


of polynomial rings; this isomorphism maps f to g. Because / is irreducible in 
K’[X] we then deduce that g is irreducible in K[X]. Finally, (27) yields 


GR ets Ss) LR) COG a Xn) (RO) Bn 


(see F3). This completes the proof of Theorem 4. 


The proof also showed that (24) is an isomorphism, and hence that the field 


k(s1,...,8n) of symmetric functions can be seen as the field of rational functions 
in the n variables s;,...,5,. Therefore: 
F4. Letr © k(X1,..., Xn) be symmetric. There exists a unique 


geEk(X,...,Xn) such that r = g(s1,...,5n). 
In fact we have more: 


Fundamental Theorem on Symmetric Functions. Every symmetric polynomial h 
in k[X1,..., Xn] has one and only expression in the form 


h= g(s1,..-,5n) 
for g apolynomial in k[X,,..., Xn]. 


This theorem can be proved directly with some effort; see, for example, van der 
Waerden, Algebra I. But it can also be shown to follow from F4 if one has the right 
conceptual tools; we do this in Chapter 16.” 


F5. Given any finite group G, there exists Galois extensions E/F such that 
G(E/F)~G. 


Proof. Setn = |G|. Then G is isomorphic to a subgroup U of Sy, (Chapter 10, F2). 
By F3 (or Theorem 4) there exists a Galois extension E/K such that G(E/K) = Sn. 
Let F be the fixed field of U in E. By Galois theory, G(E/F) =U = G. 


E 
ie 


8, 


K 


2 For a delightful little exercise, show that, conversely, F4 follows from the theorem just 
stated; this is simple once you see how. On the other hand, F4 implies: /fr is symmetric 
andr = g/h for g,heEk[X,,..., Xn] relatively prime, then g and h are symmetric. To see 
this, show first that if two elements of k[s1,..., 57] are relatively prime in k[s1,..., Sy], 
they are also relatively prime in k[Xj,..., Xn]. 
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Remarks. With F5 we have been able to give a surprisingly simple answer to our 
Question B (page 173). Another, much harder question is this: Once a ground 
field K is fixed, for what finite groups G is there an extension E such that E/K 
is Galois with G(E/K) ~ G? In particular, taking the case K = Q, one would 
like to know whether every finite group G occurs as the Galois group of a Galois 
extension E'/@. This is a central problem of inverse Galois theory. It can be shown 
using more or less elementary methods that for every natural number 7 there exists 
f € Q[X] such that the Galois group of f over @ is isomorphic to S,, (see van der 
Waerden, Algebra I). A much deeper result of Scholz and Shafarevitch says that 
every solvable group G occurs as the Galois group of a Galois extension £/Q. 


As to our Question A on whether there exist nonsolvable finite groups, the 
following statement will be proved in the next section: 


F6. The symmetric group Sy, is not solvable for n = 5, and it is solvable for n < 4. 
As a corollary we obtain: 


Theorem 5 (Abel, Ruffini). The general polynomial of degree n over k is not solv- 
able by radicals ifn = 5. 


Proof. Let g be as in (20) the general polynomial of degree n over k and let 
K =k(u,...,Un) be its coefficient field. By Theorem 4, the Galois group of g 
over K is isomorphic to the symmetric group S,. If the polynomial g € K[X] were 
solvable by radicals, therefore, S;, would have to be a solvable group, by Theorem 3; 
and F6 says this is not the case for n > 5. 


F7. Let K be a field with char K 4 2,3. Every f € K[X] of degree at most 4 is 
solvable by irreducible radicals. 


Proof. Let G be the group of f over K. Then G is isomorphic to a subgroup of 
Sn, where n is the degree of /. In view of the second sentence of F6, together with 
part (a) of Fl, G is solvable. The assertion then follows from Theorem 2’. 


4. We now must prove F6; and we take the opportunity to talk a bit about permuta- 
tions. In this section M will denote a set with n elements, say M = {1,2,...,n}, 
and S = S(M) ~ Sy will denote the group of all permutations of M@. The group 
S acts on M via 

(0,a)> oa =o(a). 


For a given o € S we consider in particular the action of the cyclic group H = (oc) 
on M. The orbit of a € M under H is also called the orbit of a under o. Let 
Hg ={p€ H| pa=a} be the stabilizer of a under H, and d = H: H, its index in H. 
Theno'a=ola => 0 Ja=a & o' J eH, = dli-j — i= modd, 
so the orbit 


2 


Ha= {d,oa,o°a,... ot ah has d elements. 


Now set a; := 0'~'a, so that 


0a; = a2, 0a2 = 43, easeos Oadg =a. 
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Definition 4. An element p € S(M) is called a cycle of length d if there exist d 
distinct elements a@;,@2,...,@q in M such that 


paj =aj4, fori<d; pag=a,; pa=a forall ae M ~ {a,...,aq}. 


Remarks. (a) A cycle of length | is the identity: p = 1. 


(b) Given d distinct elements a,,d2,...,aqg € M there is obviously a unique 
p € S(M) satisfying the conditions in Definition 4. This cycle p of length d 
is denoted by 

p=(a\d2...aq). 


Note that (a;a2...aq) = (a203...aqgd,) =-+-+ = (aga ...ag_,). Moreover, 
(a) = 1 for anyae M. 


(c) A cycle of length d obviously has order d. 


(d) Cycles transform elegantly under inner automorphisms: for any t € S(M), 


(28) T(a\d2.. .aqg)t! = (Td, Ta2...Taq). 


Given o € S(M), define the set W(o) = {a € M | oa £4}; this is the union of 
orbits of o (i.e., orbits under ()) that have length greater than 1. Two permutations 
o,t € S(M) are called disjoint if W(o) N W(t) = @. Obviously, in this case the 
two permutations commute: ot = To. 

Again let 0 € S(M) be given and let C;,...,C; be the distinct orbits of o. As 
we saw above, there exists for each Cj a cycle p; such that pja =o for all a € Cj 
and W(p;) © C;. By definition, p;, 92,..., P- are pairwise disjoint. We claim that 


O=P1p2..--/pr- 
Indeed, any given a € M lies in precisely one C;, and then 
P1p2..- Pra = pja=oa. 
We thus get the first statement in the following result: 
F8. Every o € S(M) can be represented as a product 
(29) O = (12... Pr 
of pairwise disjoint cycles with 
(30) ye length p; =n. 
i 


This representation is unique apart from the order of the factors. In (29) one can of 
course omit cycles of length 1, but then (30) is no longer satisfied. 
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Proof. What remains to be proved is the uniqueness. Take o 4 | and let o = 
Vi¥2---Ys be another decomposition into pairwise disjoint cycles y1,..., ys of 
length greater than 1. Take a € W(y,). There is precisely one j such that a ¢ W(p;); 
we may as well assume it is 7 = 1. Then 


yid =O0a= pia, Vie = OY 15 P10 = Of. 


It follows that 1 Ga) = yka =o*qg= pka = pi(pkta). Thus the orbits 
W(”1) and W(p,) of a under y; and ; coincide, and because of this we get 
YX =ox = p,x for all x in W(y,) = W(p;). Outside of this set both permutations 
act trivially. Altogether we obtain y; = 1, and by cancellation pz... p, = y2... Ys. 
By induction we are done. 


F9. (i) Leto € S(M) have the decomposition o = pi p2... Pr into pairwise 
disjoint cycles. The order of o is the least common multiple of the lengths of 
the pi- 

(ii) Two elements o,0' of S(M) are conjugate in S(M) if and only if their decom- 
positions into cycles have the same type (the type is defined in the proof). 


Proof. (i) Since all the 9; commute, we have 
(31) o” = pi'p'...p; foreach m. 


In particular, if m = v is the lcm of the lengths (which is to say, the orders) of 
the cycles ;, equation (31) implies that ord(a)|v. From (31) we get the cycle 
decomposition of 0”. Thus from o” = 1 we get p/” = 1 for all i. Therefore 
ord(p;) divides ord(o) for all i, and hence so does v. 


(ii) A decomposition 


O = Pip2---Pr 
into pairwise disjoint cycles is called normalized if it satisfies condition (30) in F8. 
We say that o € Sy has type c1,C2,...,Cn if the normalized decomposition of o 


contains precisely c; cycles of length 7. We have n = )° ; J¢j- For an arbitrary 
tT € Sn, equality (28) says that 


tot! =(tpyt')(tp2t!)... (tpt ') 


is the normalized cycle decomposition of tat~!; this of course has the same type 
as the normalized decomposition of o. 
Conversely, suppose o, 0’ € S(M) have the same type and let 
C= PivecPrs. 8 SPysee Ber 
be their normalized decompositions. Then r = r’, and after renumbering we can 
assume that 
length o; =lengthp; for 1 <i <r. 
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For fixed 7 set 
pi =(@1...4g), py = (A... 04), 


and let y; be the bijection from M; = {a1,...,aq} onto Mj = {a',,...,a’,} defined 
by gi (a;) = di. Because the 0;,..., 0, are disjoint we can form a map t: M > M 
coinciding with g; on each Mj, and this is a bijection because the p/,,..., o) are 


also disjoint. Again from (28) we get 


ror! = (cot ')...(to,pt |) =p}...p, =o", 


so o and o’ are conjugate in S(M). 


We denote by 
(32) sen: S, > {+1,-l} 


the well known signature map or parity map, which assigns to each permutation 
o € S, its sign or parity sgn(o) (see for instance LA I, p. 160). A permutation o is 
called even if sgn(o) = 1; otherwise it is called odd. If 


(33) O =T1T2...Ts 


is a representation of o as a product of transpositions (cycles of length two), we 
have 


sgn(o) = (—1)’. 
Note that the t1,..., 7; in (33) are generally not pairwise disjoint, nor is a repre- 
sentation in the form (33) unique; only the parity of s is determined by o. 
For cycles o of length d, one clearly has 
O = (1d2...dg) = (4142) (4243)... (4g—14a), 
so 
(34) sgn(a) = (—1)47". 
Since the map (32) is a homomorphism, the set 
An:= {o € S, | sgn(o) = 1} = ker sgn 


is a normal subgroup of S,, of index 2 if n = 2. It is called the alternating group 
of degree n, and it consists of all even permutations in S,. By (34), cycles of odd 
length are even and cycles of even length are odd. 

We now can easily show: Sy is solvable forn <4. For S; = 1 and Sp ~ Z/2Z 
this is clear; for S3; we have the chain 


(35) S32 Az;b 1 with $3/A3 ~ /22, Az xX /3Z. 
In S4, consider the set 


(36) Va = {1, (12)(34), (13)(24), (14)(23)}, 
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consisting of the identity and of all the double transpositions in S4. Since 
(12) (34) (13) (24) = (14) (23) = (13) (24) - (12)34), 


this is a subgroup of $4, isomorphic to Z/2Z x Z/2Z. It is called the Klein four- 
group. In view of (28), it is a normal subgroup of S4. Clearly V4 is contained in 
Ag, and by considering the orders we see that A4/V4 ~ Z/3Z. Now the chain 


(37) Sab AgeVsEb 1 


shows that S44 is solvable. Moreover one easily sees that V4 is the only nontrivial 
normal subgroup of Aq, and that S4/V4 ~ S3. 


Lemma. Let n= 5 and let G be a subgroup of Sn containing all three-cycles (that is, 
cycles of length three). If N is anormal subgroup of G with abelian quotient G/N , 
then N contains all three-cycles. 


Proof. Let (abc) be a three-cycle. Since n > 5, there exist d,e such that d,e € 
M \~ {a,b,c} and e # d. Set o = (ace) and p = (abd). By (28) we have 
pop ' = (pa pc pe) = (bce), so 

pop ‘a! = (bce)(eca) = (abc). 


But N contains all commutators of elements of G since G/N was assumed abelian; 
in particular N contains (abc). 


With this lemma we can now easily show that S, is not solvable for n = 5. For 
otherwise there would be a chain S, = Go &> G, > G)>---> Gy», = 1 with all 
factors Gj-1/G; abelian. By induction, it would follow from the lemma that every 
G; contains all three-cycles. But this is impossible since Gm = 1. 


The unsolvability of S, for n => 5 has had interesting consequences for us from 
the field-theoretic point of view; but in fact a much more encompassing result is true: 


Theorem 6. The alternating group Ap is simple forn = 5. 


Recall that a group G # 1 is called simple if it has no normal subgroup apart 
from itself and 1. A proof of Theorem 6 is outlined in $15.13 and §15.16 in the 
Appendix. 


5. Returning to our earlier line of investigation, we now wish to find an explicit 
solution by radicals for cubic polynomials f over a field K of characteristic distinct 
from 2 and 3. As remarked on page 165, we may as well assume that / has the 
form 


(38) f(X) =X? + pX +4; 


we also assume that / is irreducible in K[X]. Let E be a splitting field of f over 
K and let a@1,@2,@3 be the roots of f in E. We now proceed according to the 
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theory. First we adjoin a primitive third root of unity ¢, and so form the diagram 


E':= E(G) 
| K':= K(¢) 
K ae 


Now let F’ := K’(/D), where D denotes the discriminant of f (Chapter 8, 
Definition 4). Since VD is not preserved by any transposition of roots, we have 
E': K'(/D) <3. At the same time, since / is irreducible, E: K is divisible by 3, 
and hence so is E’: K’, since K’: K < 2. Putting it together we get 


E': F' =3. 


Thus the extension E’/F’ is cyclic of degree 3. By the material in Chapter 14 — 
see in particular equation (6) there—we have E’ = F’(v), where v, a Lagrange 
resolvent, is defined by 


v=B+6?o(B) + 607(), 


where f € E’ is to be chosen so that v 4 0 but is otherwise arbitrary. We first try 
B =a, so that 


(39) v=ayt lan +la3, 

and we consider at the same time the resolvent 

(40) u=a, + lar +Ca3. 

We have u + v = 2a; + (+ €7)oo + (§ + €7)a3 = 3a since 1+6+4+¢7 = 0 and 
a +@2 +3 = 0; and observing that multiplication by ¢ or ¢7 in (39) and (40) 


induces a cyclic permutation on the indices, we likewise get the last two equations 
in the following trio: 


(41) utv=30,, Cut+lv=3a,, lCut+t?v = 303. 


By construction, uw? and v? lie in F’; one sees on conjugacy grounds that these 
are the roots of a quadratic equation over K’, and we can actually write down this 
equation explicitly by computing the sum and product of u3 and v?. First we have 


w+ve=(utv)\(utov)(ut+ C7v) = 3a «303-302, 
where the second equality comes from (41). Therefore 


(42) ui+v? =—27q. 
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Next, from (39) and (40) we get 

(43) uv = a} +o3+az + C7 (ayay+ar203 +0103) + C(aya2+4203+01 03) 
=ajtoast+az + (o?+6)p =at+a3+oa}—p 
= (a1 +a2+03)? —3p = —3p, 


so 
(44) uey? = —27p?. 

Thus u? and v? are the roots of the quadratic polynomial 
(45) XO gx OT pe, 


whose coefficients actually lie in K. There follows 


Gages Gyre): 
Together with (41) we then get: 


F10. Let K have characteristic distinct from 2 and 3. The roots of the polynomial 
f(X) = X37 4+ pX +q € K[X]are given by 


qd a ae (2) +(2)° 
(46) \-4 + (2) +(2) + y- 5 


where one cube root y can be chosen at will, but the other cube root y'’ must be 
chosen so that 3yy' = —p. (The choice of a square root is arbitrary but must be the 
same in both terms.) The discriminant D of f (X) is 


(47) D =—4p? — 274’. 


Proof. Regardless of whether f is irreducible, we have seen above that, if w and v 
are defined by (39) and (40), they satisfy the relations (41), and u>,v> are the roots 
of the quadratic equation (45). It follows that every root of # must have the form 
(46). To be safe we must show that for any f (even if inseparable), the numbers 
given by (46) are indeed zeroes of f. The cube roots y = u/3 and y’ = v/3 
that occur in (46) satisfy y? + y’? = —q and 27y?y’3 = —p? (since u?, v? are 
solutions of the quadratic equation X* + 27qX — 27p? = 0), and by assumption 
they are normalized so that 3yy’ = —p. It follows that (v+y’)?+p(y+y’)+q = 


yity?t3y?y' +3yy? t+ piyty) +49 =-94+3vy (vty) + viyty’) +9 =0. 
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Finally we have to verify the formula (47) for the discriminant. Setting A = 
(0-3) (1-13) (2-03), we have 
27A = (3a1—3a2) (301-33) (302-33) 
= (utu—C7u—6v)(u+u—Cu—C7 v) (C7 ut+-6v—lu—C" v) 
= ((1-$)?u + (1—g)v) ((I—S)u + $7) v) (6-8) + (6-67) 
= (1-6)? ((1+5)u + v)(u + (1+6)v)((u-v)(@?—-9)) 
= (6-1)°S(—Cu+v)(u—6"v) (uv) 
= (1-$)? (u—Gv) (u—7 v)(u—-v) = (1-$)? (W-v’); 
setting D = A? and using the fact that (¢— 1)? = ¢7 -2¢+ 1 = —3¢ we get 
2P D = (€—1)§ (v3 — v3)? = —27(43 — v3)’, 
so —27D coincides with the discriminant of the quadratic polynomial (45): 


~27D = 274? +4-27p’. 


This implies (47). 


Note that (47) can be rewritten as 


(48) DAA 7 ((4) ES (4)’). 


so that the square root appearing in the Cardano formula (46) is related to the 
discriminant by 


(49) V25p 9-5 (2) +(2). 
Remarks. (1) From the introductory remarks of this section we see that, if the 
cubic polynomial f is irreducible, the extension E/K(\/D) is cyclic of degree 3. 
Moreover, since f has degree 3, the irreducibility of f in K[X] is equivalent to 
there being no roots of f in K. In the sequel we will identify the Galois group G 
of f over K with a subgroup of S3, by fixing the numbering of the roots a1, a2, 03 
of f. 


If f is irreducible we therefore have: 


(50) é= he if D is a square in G, 


S3 if D is not a square in G. 


An example of the first alternative with K = Q — that is, a cubic polynomial over 
Q whose Galois group G over Q is cyclic of order 3—is given by 


(51) X* S341, 


For this is clearly irreducible (Chapter 5, F8), and it has discriminant D = 81 by 
(47). Examples of the second alternative, illustrating the (generic) case G = $3, are 
given by X3-X+4+1, ¥3—2, and X3—4¥-+1, with discriminants —23, —108, 229. 
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(2) Assume that K is a subfield of R. Then, for any cubic polynomial f over K, 
(52) D>0 <> all the roots of f lie in R. 


To see this, note first that f, being a polynomial of odd degree, has at least one real 
root. Either all roots @;,@2,a@3 of f are real or, say, a; is real and a3 = @ ¢R. 
In the former case A = (a; —a2)(@1—3)('2—@3) is real, so D = A? > 0; in the 
latter case, A = (1 —@)(@1—@2)(@2—@) is purely imaginary, so D = A? <0. 
Suppose in addition that D 4 0, so a1, @2,@3 are all distinct. By (48) we have 


D>0 = v(2) +(4) ee. 


Thus, for f separable, equation (52) says that the Cardano formula (46) yields three 
real roots if and only if the square root appearing in the formula is nonreal (“casus 
irreducibilis’’). 


6. In this section we will deal with solvable equations of prime degree. We will 
need to have at hand the following group-theoretic result. 


Lemma. Let p be a prime number and let G be a subgroup of the full permutation 
group of a set M with p elements. Assume that G acts transitively on M. 


(a) Any normal subgroup N # | of G also acts transitively on M. 


(b) If G is solvable, it contains a unique subgroup H of order p (which is then 
necessarily normal in G). 


Proof. (a) Take a € M. For any o in G we have 
oNa= Noa. 


Since G acts transitively on M, all orbits of N have the same length, say m. Thus 
m divides p =|M| (see Fl’ in Chapter 10). Since N 4 1 we have m > 1. It follows 
that m = p, that is, N acts transitively on M. 


(b) Since G acts transitively on the p-element set M, the order of G is divisible by 
p (Chapter 10, F4). Assume |G| > p (otherwise there is nothing to prove). Because 
G is solvable, it contains a normal subgroup N distinct from G and from 1. By 
part (a), N acts transitively on MM. By induction on the order of the group, we 
can therefore assume that N has exactly one subgroup H of order p. Then H is a 
characteristic subgroup of the group N; that is, for each automorphism a of N we 
have oH = H. In particular, H is a normal subgroup of G (because every inner 
automorphism of G gives rise to an automorphism of NV, since N is normal in G). 
So G contains a normal subgroup H of order p. Let H’ be any subgroup of order p 
and assume that H’ is distinct from H. Then HM H’ = 1, and the quotient H’H/H 
satisfies 
H'H/H ~ H’/H'OH ~ H’, 


so the subgroup H'H of G has order p” (see F2). It follows that p* divides |G], 
and hence p!, contradicting the primality of p. 
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Now let f € K[X] be an irreducible polynomial of prime degree p. We assume 
that f is separable (that is, if p = char K, the polynomial does not have the form 
c(X? —a); see Chapter 7, F12). The Galois group G of f over K acts transitively 
on the p-element set M of roots of f in a splitting field E of f over K (Chapter 
8, F7). 

Assume that the Galois group G of f over K is solvable. By part (b) of the 
preceding lemma it follows, first, that G contains an element p of order p. Regarded 
as a permutation of /, the element p is then necessarily a cycle of length p, that 
is, for some numbering 1, 02,..., 0p of the roots of f in E, we have 


(53) pa; =ai41 fori <p, Pap = ay. 


We now identify M = {a,a@2,...,a@p} with the p-element field F, via the map 
a; +> 7, thus also identifying G with a subgroup of S(E,). Then (53) becomes 


(54) px=x-+1 forall x ek. 


In other words, the action of p on elements of F, is simply translation by 1. But 
more is true: Let o be any element of G. By the lemma, the subgroup generated by 
pis anormal subgroup of G. Therefore 


for some natural number a < p. Thus opx = p*ox for all x € F,; hence, by (54), 
o(x+1)=ox+a. 
Setting b := 0 (0), we get o(1) = b+a, o(2) = b+a+a, ..., and in general 
ox=ax+b forall x ef. 


Definition 5. A permutation o € S(E,) is called affine if there exist elements a € Fy 
and b € F, such that 


(55) ox =ax +6 forall x € fF. 


A subgroup G of S(F,) is called an affine subgroup of S(F,) if every element of G 
is affine. 


Theorem 7 (Galois). (I) Let f € K[X] be irreducible of prime degree p; also 
assume that f is separable (that is, not of the form c(X? —a) if p = char K). 
As before we regard the Galois group G of f over K as a subgroup of S(Fp). 
Then, if G is solvable, it is an affine subgroup of S(Fp). 
(Il) Every affine subgroup of S(Fp) is solvable. 


Proof. Part (I) has been proved above. For part (II), let’s denote an element o € S(F,) 
of the form (55) by 04,5; note that b € F, anda e€ rp are uniquely determined by 
o. A simple calculation shows that 


Oa,b ° Oa’, b’ = OCaa',b+ab’- 
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Let G denote the set of all affine elements of S(F,). Then G is a subgroup of S(F,), 
and the map o = 0,4 +> a is a homomorphism form G onto the multiplicative group 
Ce of F,. Let N be its kernel. Obviously, N consists of all elements of the form 
01,5, that is, of all translations. The map o;,, +> 6 is then an isomorphism of N 
onto the additive group Z/pZ of F,. Thus N is cyclic (of order p). The quotient 
G/N is isomorphic to the multiplicative group FX, by the foregoing. Therefore 
G/N is also cyclic, by Chapter 9, Theorem 2. Putting it all together we conclude 
that G is solvable (see Fl). Since a subgroup of a solvable group is solvable, we 
are done. 


The lovely result just proved is Proposition VII of Galois’s “Mémoire sur les 
conditions de résolubilité des équations par radicaux”, and can be found in Ecrits 
et mémoires mathématiques d’Evariste Galois, Gauthier- Villars, Paris, 1962. 


Remark. The group S(F,) itself is not affine for p = 5, since it has precisely 
P(p—1) elements of the form (55). By Theorem 7, therefore, S, is not solvable 
(since up to isomorphism, S, occurs as the Galois group of the general equation of 
degree p: see Theorem 4). In particular, Ss is not solvable. And since for n = 5 
the symmetric group S, clearly has a subgroup isomorphic to Ss, we have proved 
again that S,, cannot be solvable for n > 5. 


As an application of Theorem 7 we obtain another result of Galois: 


F11. Let f € K[X] be irreducible of prime degree p, and assume f is separable. 
Let E be a splitting field of f over K and G the Galois group of E/K. 
If G is solvable, E arises by adjunction of any two distinct roots of f . 
Conversely, if there exist two roots a, B of f such that E = K(a, B), the group 
G is solvable. 


Proof. Let G be solvable and let a and 6 # a be roots of f in E. We must show that 
E = K(qa, B), or equivalently, by Galois theory, that G(E/K(qa, B)) = 1. Assume 
for a contradiction that G(E/K(a, B)) contains an element o 4 1. Then @ and 6 are 
both left fixed by o. But a map of the form (55) distinct from the identity obviously 
has either exactly one fixed point or none. 

For the converse, suppose that E = K(a, B), where a, B are roots of f. Hence 


(56) IG|= E:K < p(p-1), 


because K(a): K = p and K(a, 6): K(a) < p—1 no matter what. Also since 
K(a): K = p, there is an element p of order p in G. If H = (p) is normal in G, 
it follows, as we saw above, that G is isomorphic to an affine subgroup of S(F,). 
Thus G is solvable, by Theorem 7. On the other hand, if H is not normal in G, we 
can find o € G such that H’ := 0! Ho is distinct from H. Because p is prime, 
H (1 H' =1. Therefore HH’ (though it need not be a subgroup of G) contains p? 
elements, in contradiction with (56). 


The following consequence of F11 was first stated by Kronecker: 
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F12. Let K be a subfield of the field R of real numbers, and let f € K[X] be an 
irreducible polynomial of prime degree p > 2. If the Galois group of f over K is 
solvable, f has either exactly one root in R or all its roots inR. 


Proof. Being a polynomial of odd degree, f certainly has at least one root a in R. 
Assume there is another one, 6B € R, with 6 4 a. Using F11 we conclude that the 
subfield K(a, 6) of R must be a splitting field of f over K. But then all the roots 
of f lieinR. 


Remarks. Using F12 it is not hard to find polynomials of prime degree with rational 
coefficients that are not solvable by radicals over R. For example: 


(i) For every prime p = 5, the polynomial 
(57) f(X) =X? -4X 42 
is not solvable by radicals over Q. 


Proof. First, by Eisenstein’s criterion, f is irreducible over Q. If f were solvable 
by radicals over Q, it would have, by F12, either exactly one real root or exactly 
p real roots. But a simple analytic argument shows that in fact f has exactly three 
real roots: If k is the number of real roots of /, the derivative f’(X) = pX?~!—4 
has at least k—1 roots in R, by Rolle’s Theorem. But clearly pX¥ ?~! —4 has exactly 
two real roots, so k < 3. On the other hand, the Intermediate Value Theorem gives 
k = 3, since f(—2) <0, f(0) > 0, f(1) <0, f(2) > 0. 


For p = 5 more can be proved: 


(ii) The Galois group of X* —4X +2 over Q is isomorphic to the full symmetric 
group Ss. 


Proof. As already seen, the polynomial (57) in the case p = 5 has precisely two 
nonreal roots, which we can view as elements of C, by the Fundamental Theorem of 
Algebra (see Remark to Definition 2 in Chapter 6). The permutation of the roots of 
f determined by complex conjugation, z+ Z, is therefore a transposition. Hence, if 
we regard the Galois group G of f over @ as a subgroup of Ss, this group contains 
a transposition. Since G also contains a cycle of length 5 (any element of order 5), 
it must coincide with S5, as follows form the following general fact, whose proof 
is left as an exercise. 


(iii) [fa subgroup G of Sy, contains both a transposition and a cycle of length n, and 
nis a prime number, then G = Sy. (The primality of n is an unavoidable assumption; 
for instance, in S4 the elements (1234) and (24) generate a subgroup of order 8.) 


The converse of the statement of F12 is not true; just consider the polynomial 
(58) f(X) = X°-X-1, 


which has a single real root (again by calculus), but whose Galois group is iso- 
morphic to S5. Justifying this last assertion is not so simple, but it’s easy to see 
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at least that f is irreducible over Q: it suffices to show that f is irreducible as a 
polynomial over some field F, (see F9 and F6 in Chapter 5), which is the case for 
instance when p = 5 (Theorem 3 in Chapter 14). The irreducibility of f implies, 
if nothing else, that the Galois group G has an element of order 5, that is, a cycle 
of length 5. To prove the equality G = Ss, one can for example — in view of (iii) 
above — check that G contains a transposition. One way to do this is to look at f 
over Fy; here we have the prime factorization 


X5-— NX —1 = (X74-X41)(X74-X741) in FLX]. 


Hence the Galois group G of X*— X —1 over Fp certainly contains a transposition. 
Now by a general principle due to Dedekind, this allows one to conclude that G, 
too, contains a transposition; see F13 in Chapter 16. (Another way of proving that 
G = Ss is outlined in §15.25 in the Appendix.) 


7. To conclude this chapter we will cite a beautiful —and fairly deep — theorem 
of David Hilbert (1862-1943), without undertaking to prove it here. 


Hilbert Irreducibility Theorem. Let k stand for the field Q or for any finitely 
generated extension of Q. Then k has the following property: 

If f =X™ + am-rilth,...,t)X"! +--+ + ao(tt,...,tn), with m,n = 1, is 
an irreducible polynomial in the ring k{t,,...,t,, X| of polynomials in the n+1 
variables t,,...,tn, X over k, there exist infinitely many (c,,...,Cn) € k” such that 
the polynomial f(c1,...,¢n, X) € k[X] is also irreducible. 


Surely this theorem already speaks for itself; but its relevance to Galois theory 
will be put in sharp focus by a later result (Chapter 16, F14). 


Remarks. (a) A field having the property stated for k in the conclusion of the 
theorem is called Hilbertian. It turns out that a field is already Hilbertian if 
the property holds with n = 1. 


(b 


wm 


Let k be Hilbertian. For 7 > 1 a stronger version of the property holds, namely: 
If in addition to the irreducible polynomial f € k[t,...,tn, X] we are given 
a nonzero polynomial g in k[f1,..., tn], there exist infinitely many c € k” 
such that f(c, X) is irreducible in k[X] and such that g(c) 4 0. See Lang, 
Diophantine geometry, Chapter 8. This further implies the following fact, by 
F12 in Chapter 7: [fan irreducible polynomial f €k[ti,...,t,, X|is separable 
as a polynomial over k(t, ...,tn), there exist infinitely many c € k" for which 
FI (c, X) is irreducible and separable over k. 
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Integral Ring Extensions 
with Applications to Galois Theory 


1. We now explain how the notion of an algebraic field extension can be generalized 
to rings in the appropriate way. Let A be a ring with unity and R a subring of A 
containing the unity of A. Suppose also that R is central, that is to say, each of its 
elements commutes with all elements of A. (In particular, R is commutative.) In 
this situation we say that 


A/R is a ring extension. 


If A/R is a ring extension, we can regard A in a natural way as an R-algebra. 
Conversely, if A is an algebra (with unity element 1) over a commutative ring R 
with unity, there is a natural ring homomorphism 


R-A 
arPal 


whose image R’ is a subring of the center of A and contains the unity of A. Then 
A/R’ is a ring extension in the sense just defined. 


Definition 1. Let A be an algebra over a commutative ring R with unity. An element 
a of A is called integral over R if there exists a normalized polynomial 


(1) f(X) =X" Hay X" 1 +--+ +49 € RIX] 

of degree n > 1 over R such that 

(2) f(a) =a" +.ay_10" 1 +--+» + aga® = 0. 

An equation of the form (2) is called an integrality equation for a over R. 


Examples. (1) For a field extension E/K, an element of £ is integral over K if 
and only if it is algebraic over K. 


(2) Let k be a field and K = k(X,,...,X,) the field of rational functions in 
n variables X1,..., X, over k. Let s1,...,5, be the elementary symmetric 
functions in X;,..., X,. Then each X; is integral over k[s1,..., 5,]; see (17) 
in Chapter 15. 
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(3) The elements of C that are integral over Z are called algebraic integers. The 
following complex numbers, for example, are algebraic integers: 5, /2, 1 +i, 
e27!/" for n € N, and 3(-1 + 5). 


Fl. Let A be an algebra over a commutative ring R with unity. If A is finitely 
generated as an R-module, every element of A is integral over R. 


Proof. This is a direct generalization of the fact that every finite field extension is 
algebraic (F4 in Chapter 2). Admittedly, the proof is harder. Since we still lack 
certain conceptual tools, we fall back on the following “classical” argument: 

By assumption, A possesses a finite set of generators B1,..., Bn, over R. We 
can assume that 6; = | (by adding this extra generator if needed). We can also 
assume that R is a subring of A. Let a € A be given. For each | < j <n we have 


n 
ap; = S > aj Be for some ax; € R. 
k=1 


Otherwise stated, there are relations 


n 


(3) So (axi —6jxa) By =0 for 1<j <n, 
k=1 


where 6;, is the Kronecker delta. Make the abbreviation 
Cjk = akj — 07k 


and denote by C = (c;x);,x the corresponding n x n matrix over the commutative 
ring Ria]. The adjoint matrix C = (€jx)j,~ of C satisfies 


CC = det(C) Ey, 


where E,, is the n xn identity matrix (see for example LA I, p. 148). From (3) there 
follows, for all 1 <i <n, 


n n non n 
0= aij ( Dauber) =o Davee = Yeu) sib = den(C) fy. 
j=1 k=1 k=1 j=1 k=1 
Since 6; = 1 this implies 
det(C) = det ((a;x —5jx@);,n) = 0. 
Thus @ is a root of the polynomial 


f(X) = det ((5;4X —ajx)) € R[X]. 


But f(X) is normalized of degree n > 1, so @ is indeed integral over R. 
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Definition 2. A ring extension A/R is called integral if every element of A is 
integral over R. 
A ring extension A/R is called finite if A is finitely generated as an R-module. 


Given this definition, Fl can be rephrased very simply: 
FI’. Every finite ring extension A/R is integral. 
The following integrality criterion for elements can then be stated: 
F2. Let A be an algebra over a commutative ring R with unity. Given an element a 
in A, there is equivalence between: 
(i) @ is integral over R. 
(ii) The subalgebra R[a] of A is finitely generated as an R-module. 
(iii) There exists a subalgebra A’ of A that contains a and is finitely generated as 


an R-module. 


Proof. (i) => (ii): By definition, R[a] = {g(a) | g € R[X]}. Let f(a) = 0 be an 
integrality equation for a over R. Since f is normalized, division with rest yields 
for any g € R[X] a representation 


8(X) = A(X) f(X) + r(X), 


with h(X), r(X) € R[X] and degr < deg f =:n. Since g(a) = r(q@) it follows that 
l,a,...,a@”~! generate the R-module R[a]. 
The implication (ii) = (iii) is trivial, and (iii) > (i) follows from FI. 


In the sequel we will assume all rings to be commutative with unity. 
Lemma 1. /f 4/R and B/A are finite ring extensions, B/R is also finite. 


Proof. The relations A = Re; +---+ Rem and B = Af, +---+ Af, obviously 
imply B= Re; fi +---+ Rem fn. 


F3. For a ring extension A/R, there is equivalence between: 
(i) There exist finitely many elements a1,...,Qm of A integral over R and such 
that A = R[a,,...,Qm]. 
(ii) A/R is finite. 


Proof. (ii) => (i) is clear: We can choose for @1,...,Qm a set of elements that 
generate A as an R-module. Then we actually have A = Ra, +---+ Ram, and Fl’ 
says that the a; are integral over R. 


(i) > (ii) is proved by induction on m. For m = 0 there is nothing to prove. Suppose 
(i) holds with some m > 1 and set A’ = R[a1,...,@m—1]. Then A = A’[a»], and 
Qm is integral over A’. Therefore, by F2, A/A’ is finite. A’/R is also finite, by the 
induction hypothesis. The finiteness of A/R follows from Lemma 1. 
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F4 and Definition 3. Let A/R be a ring extension. The subset 
C ={a€A|a is integral over R} 
is a subring of A containing R. We call C the integral closure of R in A. 


Proof. Clearly R € C, since every a € R is a root of the normalized polynomial 
X —a over F. Now let a, B be elements of C, and take the subalgebra R[a, 6] 
of the R-algebra A. By F3, the extension R[a, 6]/R is finite. Thus, by FI’, all 
elements of R[a, 6] are integral over R, so 


Ria, BJ] SC. 


In particular, w + 6, w — B and af all belong to C. This completes the proof. 


Definition 4. Let 4/R be a ring extension. We say that R is integrally closed in A 
if R coincides with its integral closure in A. 


F5. For every ring extension A/R, the integral closure C of R in A is integrally 
closed in A. 


Proof. Let a € A be integral over C. Then 
oo +0,-10" | +---+a9=0, with a eC, n>. 


Clearly @ is then integral over the R-subalgebra A’ = R[ao,a1,...,@n—1] of C as 
well. By F3 the extension A’/R is finite; A’[w]/A’ is also finite, by F2. Therefore 
A'[a]/R is finite. But then, by F1’, the element @ must be integral over R (since it 
belongs to A’[a@]). This shows that a € C. 


F6. Let A/R and B/A be ring extensions. If A/R and B/A are integral, so is B/R 
(and conversely). 


Proof. Let C be the integral closure of R in B. Since A/R was assumed integral, 
we have A C C. Since B/A is also integral by assumption, B/C is integral. But 
now F5 says that C is integrally closed in B, so B=C. 


2. We now turn our attention to subrings of fields. 


F7. Let E/R be a ring extension, and assume that E is a field. If a € E is algebraic 
(equivalently, integral) over the field K of fractions of R in E, there exists a nonzero 
c € R such that ca is integral over R. 


Proof. Let 
(4) f (@) =a" + aya"! +++» +9 =0 


be an algebraic equation for a over K. There certainly exists a nonzero c € R such 
that 
caj€é R forall O<i<n-l. 
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Multiplying (4) by c” we get 


(car)” + can_1(ca)” |! +---+ aoc” = 0, 


which is an integrality equation for ca over R. 


Definition 5. An integral domain R is called integrally closed, or normal, if R is 
integrally closed in its fraction field. 


F8. Every unique factorization domain R is integrally closed. 


Proof. This has already been stated in different words in Chapter 5, F8 — and proved 
using Gauss’s Lemma! One can also justify the statement as follows: Any element 
a of K = Frac R has the form a = a/b, with a,b € R; since R is a UFD, one can 
also assume that a, b are relatively prime. Now let @ be integral over R, satisfying, 
say, a” + dna"! +---+ a9 =0 with each a; in R. Multiplication by b” yields 


a" + bay_ja™ | +--+ aob" =0, 


so b divides a”. But because a, b are relatively prime and R is a UFD, this can only 
happen if 5 is a unit of R. Then a = a/b lies in R. 


Remark. We can now show how the Fundamental Theorem on Symmetric Func- 
tions (page 177) can be derived from F4 in Chapter 15: Let A(X1,...,X,) be a 
symmetric polynomial in k[X1,..., Xn]. We already know that / lies in the subfield 


k(s1,...,8n) of k(X1,..., Xn) generated by the elementary symmetric functions 
S1,...,5,. This subfield is the fraction field of the ring R:= k[s1,..., 5]. We also 
know that each YX; is integral over k[s1,..., 5]; thus, by F4, so is each element of 


k[X1,..., Xp]. In particular, 4 is integral over R. But R = k[s1,..., Sy] is a poly- 
nomial ring in n variables over a field k; therefore R is a UFD, by Gauss’s Theorem 
(page 46), and hence integrally closed, by F8. Thus / does lie in R = k[s1,..., Sn] 
as desired. 


Lemma 2. Let A/R be a ring extension and o : A > B a ring homomorphism. If 
a € A is integral over R, then o(a) is integral over o(R). 


Proof. This is clear. 


F9. Let E/K be a finite field extension and assume K is the fraction field of an 
integral domain R. If R is integrally closed, the minimal polynomial of an element 
a of E integral over R has all its coefficients in R. In particular, Sz; (a) and 
NeE/K (a) lie in R. 


Proof. Over an algebraic closure C of E, let g := MiPox (a) have the factorization 


n 
g(xX)= [ [¥-«). with a; =a. 

i=1 
There exist K-homomorphisms o; : K(a) — C such that o;(@) = a;. By Lemma 2, 
all the a; are integral over R. The coefficients of g are polynomial expressions 
Sj(Q1,...,@,) in the a;, and thus, by F4, also integral over R. But R was assumed 
to be integrally closed, and g lies in K[X], so we obtain g € R[X] as required. 
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Another justification for F9 is provided by the next result, which in view of F8 
represents a generalization of Gauss’s Lemma (F7 in Chapter 5): 


F10. Let R be an integrally closed integral domain, with fraction field K. Let 
ft, g,h € K[X] be normalized polynomials over K, with 


(5) f= gh. 
If all the coefficients of f lie in R, so do the coefficients of g and h. 
Proof. Let E be a splitting field of f over K. Over E we have 


f(X) = T[X~ ax). 
k=1 


Since f is normalized, all the a, are integral over R. In view of (5), there exist 
I,J C{l,2,...,} such that 


g(X) =] [(x-aj), A(X) =] [(X-9)). 
ie] jes 


Being polynomial expressions in the integral elements a; the coefficients of g and 
h are also integral over R. Moreover they lie in K, and R is integrally closed; thus 
indeed g,he€ R[X]. 


F11. Let E/R be an integral ring extension. If E is a field, so is R. 


Proof. Take a € R~ {0}. The element 1/a of E satisfies by assumption an equation 
of the form 
(1/a)" + dn—1(1/a)""! +--+ +.a9 =0 


over R. Multiply out by a”! to get 


1/a = —an_1 —An_2a—---—aga”” 1 € R. 


Thus R is a field. 


The next result is useful in various contexts: 


F12. Let A be a subring of C obtained from Z by adjoining algebraic integers. 
For a given prime p the natural homomorphism Z — Fy can be extended to a ring 
homomorphism from A into an algebraic closure of Fy. 


Proof. First note that the principal ideal pA of A is distinct from A. Otherwise there 

would be a relation 1 = pa with a € A, and then we would have 1/p€ ANQ=Z. 
Next, since pA # A, there is a maximal ideal $B of A such that pA C $B (see 

Chapter 6, F12). The inclusion Z C A then yields a natural homomorphism 


(6) Z/pZ— A/. 
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This map is injective, since Z/pZ is a field. Because ‘8 is maximal, A/B is also 
a field. We can then view A := A/S as an extension of FE, = Z/pZ, via the map 
(6). Since A is integral over Z, the field Ais algebraic over F, (see Lemma 2), and 
so is contained in an algebraic closure of F,. Thus the natural map A > A/S = A 
extends the map Z > Z/ pZ =F, and yields a homomorphism of A into an algebraic 
closure of F,. 


Remark. In addition one sees easily that if A/Z is finite, say with A=Z[a1,..., an], 
there can only be finitely many extensions with the properties stated in the conclu- 
sion of F12. 


We would not want to pass up the chance to point out that F12 is a special case 
of a result that is quite general: 


F12*. Let A/R be anintegral ring extension (of arbitrary commutative rings). Every 
homomorphism from R into an algebraically closed field F can be extended to a 
homomorphism from A into F.. In other words: For each prime ideal of R there is 
at least one prime ideal $B of A such that 


(7) p=PNR. 
We will not prove F12* here; see §16.12 in the Appendix. 


3. Now we would like to show how the basic results about ring extensions presented 
in the last two sections are useful, for instance, when one is investigating the Galois 
group of a given equation. Naturally enough, we keep in mind first the case where 
the ground field is K = Q. 

Suppose then that we are given a normalized polynomial f € Q[X] of degree 
n= 1. First we get rid of multiple roots, by taking the gcd of f and f’ and dividing 
J by it. We can also arrange for all the coefficients of f to be in Z, by making 
a substitution ¥ +» X/c for a judiciously chosen integer c (much as in the proof 
of F7) and dividing by the leading coefficient to keep the polynomial normalized. 
These changes do not affect the Galois group G of f over Q. 

It is now natural to look at the reduction modulo some appropriate prime p. We 
cannot hope that this will still leave the Galois group unaltered — consider that only 
cyclic Galois groups occur over a finite field — but we can expect to obtain some 
partial information about G. We denote by / = f(X) the canonical image of f in 
F,[X] and we assume moreover that f has no multiple roots. Over a splitting field 
E of f over Q we have 


(8) S(X) = (X — 0) (X — a2)... (X — an). 


If A denotes the integral closure of Z in E, one can extend the natural map Z — F, 
into a homomorphism ¢ from A into an algebraic closure of F, (see F12); over this 
algebraic closure we have 


(9) f(X) = (X —&)(X —&2)... (X — an), 


with a@; = y(a;). We can then state the following law: 
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Theorem 1. We maintain the preceding notation and the assumption that ‘a has 
no multiple roots. As a group of permutations of the roots &,@2,...,Gn, of f, 
the Galois group G(f) of f over Fy is isomorphic to a subgroup of the Galois 
group G(f) of f over Q, likewise regarded as a group of permutations of the roots 
1,02,...,Qn of f. 


We postpone the proof of this theorem a bit in order to illustrate its application 
to the case of the polynomial 


(10) f(X) = X°-X-1. 
For p = 2, the prime factorization of f in FpLX] is 
(11) FCO) = (X74 X41 4 X74). 


From this we see immediately that G(/) contains a transposition (as well as a three- 
cycle). Because of Theorem 1, the Galois group G(f') over @ must also contain a 
transposition (as well as a three-cycle). 

If we now examine f modulo the prime p = 5 as well, we can in fact conclude, 
as outlined on page 190, that G(f) is isomorphic to the full symmetric group Ss. 

An important ingredient in the application of Theorem 1, of course, is that G(/) 
is isomorphic to a subgroup of G(f) not just as an abstract group, but rather with 
preservation of the permutation structure; otherwise in the p = 2 example one would 
not be able to conclude that G(f) contains a transposition, only some element of 
order 2. 


The proof of Theorem 1 will be given in a more general framework. Instead 
of Z we will consider an arbitrary integrally closed ring R with fraction field K. 
We will start from a normalized polynomial f(X) € R[X] of degree n > 1. Let E 
be a splitting field of f over K and let A be the integral closure of R in E. Also 
suppose given a maximal ideal p of R with quotient field R = R/p. We assume 
that the natural map R — R can be extended to a homomorphism 


g:A>F 


from A into an algebraic closure F of R. (This does not represent a restriction; 
for R = Z we have seen why in F12, and for the general case one would resort to 
F12*.) In general there are many ways to extend R—> R to a homomorphism from 
A into a given algebraic closure F of R, but we imagine having chosen such an 
extension once and for all, and denote it by w+ q; let its image be A and its kernel 
58. An obvious idea is to form the set 


(12) Gp ={o €G| oP =f} 


of all those elements o of the Galois group G = G(f) = G(E/K) of f over K 
that map the kernel ‘8 of ¢ into itself. Clearly Gy is a subgroup of G, and by 
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the definition of Gy each element o € Gg gives rise to an automorphism of the 
algebraic field extension A/R which is well defined by the condition 


(13) G(&) =o(a). 


Note that the extension A/R is normal, which can be seen as follows: Let B € A be 
arbitrary and take g = MiPox (B). Since the normalized polynomial g € R[X’] splits 
into linear factors over EF, one sees by applying ¢ that g also splits into linear factors 
over A. Since Z(f) = 0 the assertion follows; moreover one gets R(B):R < E: K, 
on account of which the largest separable subextension of A/R must be finite. As 
we have already seen, the map 


Gs > G(A/R) 


(14) = 
Oro 


affords a natural homomorphism between the subgroup Gs; of G and the group 
G(A/R) of the normal field extension A/R. As before, let f be the canonical 
image of the given polynomial f in R[X’]. If (8) is the factorization of f over E, 
(9) is the factorization of te over A. Since R [@1,...,@,] is contained in A, the map 
(14) yields a homomorphism 


(15) Gy > G(f) 


from Gs into the group of f over R. 

From now on we assume that ra has no multiple roots, so the @,@2,...,@» in 
(9) are all distinct. Then the map (15) is obviously injective: For if oa; = a; with 
o € Gy and j #i, we get 6a; = 6d; = a, that is, o A 1. And what’s more, we 
have the following theorem (which encompasses the statement of Theorem 1): 


Theorem 2. In the situation above, assuming that ‘a has no multiple roots, the maps 
(14) and (15) are isomorphisms. 


Proof. Let’s first assume that (14) is already known to be surjective. Because (15) 
is injective, as we have just seen, it follows that G(/) = G(A/R), and we are done. 
The proof that (14) is surjective is carried out in two steps: 


(1) Let Z be the fixed field of Gy in E and let S = AN Z be the integral closure 
of R in Z. We wish to show that S = R. For this we take elements 1,02,...,0, of 
G that give rise to the distinct K-homomorphisms from Z into EF; here r = Z: K. 
By definition, then, 8 4 0; and o, $8 A P. We claim that 


(16) a 'PNS APS 


as well. For, given x € o; |B not lying in $B, the element 


Nejz(x) = I] Ox 


ocGy 
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lies in o; |B AS, but not in $8. Since the natural map S/BNS > A/P is injective 
and §8 is a maximal ideal of A, the intersection 8M S' is a maximal ideal of S' (see 
F11). Now let s be any element of S. An application of the Chinese Remainder 
Theorem (see Section 4.5, Lemma and F16) now yields, in view of (16), an element 
z €S such that 


z=smod §, z=1 modo; ' for i =2,...,r 


Then, setting 


* 
a:= Nz/x(z) = z-|] OiZ, 
i=2 
we have obtained an element of R such that a = s mod 8. This shows that indeed 
S=R. 
(2) To prove the surjectivity of (14) we can assume from now on that Gy = G. 
Let p € G(A/R) be given. We will use the fact that p is determined by its action 
on a primitive element of the largest separable subextension of A/R. Take B, with 
B € K, to be such a primitive element. Let g be the minimal polynomial of 6 over 
K and let g(X) = (X¥ —B1)(X —B2)... (X — Bm) be its factorization over E. Since 
p can only take B to a root of g, we have pp = B. i for some 7. By the irreducibility 
of g, however, there is some o € G such that o6 = B;. There follows o = p, which 
proves the claim. 


As a consequence of Theorem | we now mention a fact first stated by Dedekind: 


F13. Let f be a normalized polynomial with coefficients in Z and let p be a prime 
number for which the polynomial f of Fy|X| determined by f has no multiple roots. 
Let the prime factorization of f in FpLX] be 


(17) PS fife 


each f; having degree n;. Regarded as a group of permutations of the roots of f , the 
Galois group G(f) of f over Q contains an element o whose decomposition into 
cycles has the form 


(18) 0 =0102...0;, with lengtho; =n;. 
(A marvelous converse was proved by Frobenius: see §11.11 in the Appendix.) 


Proof, We start from the Galois group G(/) of f over Fy. The orbits of the action of 
G( f ) on the set of roots of a3 are precisely the sets of roots of the distinct irreducible 
factors f,, fo,..., fy of f in (17). But the Galois group G(/) is cyclic, and so 
generated by a single element o. Directly from the definition (Section 15.4), we 
conclude that the cycle decomposition of & has the form 6 =0102...0; with cycles 
a; of length n; = deg f;. By Theorem 1, then, each G(/) contains a permutation 
o of the same type. 
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Remark. Under the assumptions underlying Theorem 2, we have: If 8’ is another 
prime ideal of A such that 3B'N R = p, there exists an element o in the Galois group 
G of E/K such that o% = ¥'. 


Proof. Assume for a contradiction that o 58 4 ¥’ for allo € G. Applying the Chinese 
Remainder Theorem to 8 and the ideals o~' 8’ for o € G, we get ana € A such 
that 

a =0 mod §, a=I1modo '!’ forall o€G. 


Now the norm Na = [[, oa € R satisfies on the one hand Na = 0 mod 8 and on 
the other Na = | mod $’. Since BN R= p= P'N R, this is impossible. 


To conclude, we mention an interesting consequence of Theorem 2 to Galois 
Theory: 


F14. Let k be a hilbertian field (see Remark (a) at the end of Chapter 15). Let 
a subgroup G of Sy be given. G can be regarded in a natural way as a group of 
automorphisms of the field of rational functions E = k(X,,..., Xn) inn variables 
over k; let K = E® denote the fixed field of G in E. Under the assumption that K 
is also a rational function field 


K=k(t,...,t) 


inn variables t,,...,tn over k, the group G can be realized as a Galois group over 
the field k. 

Proof. By the Primitive Element Theorem there is some a € EF such that E = K(q), 
and we can assume that @ is integral over k[t,,..., tn]. So let f = f(t,...,t1, X) 
be the minimal polynomial of @ over K. Since k is hilbertian, there exist c1,...,¢n € 


k such that f = f(c1,...,¢n, X) is irreducible and separable in k[X]. It follows 
directly from Theorem 2, together with F12*, that the Galois group G(/) of f over 
k is isomorphic to a subgroup of G = G(E/K). Because / is irreducible we then 
have |G(f)| > deg f = deg f = |G|, so G is isomorphic to G(/), proving the 
assertion. 


The relevance of F14 to inverse Galois theory over Q (page 178) stands out in 
view of the Hilbert Irreducibility Theorem (Section 15.7): A given finite group G of 
order 1 can be regarded naturally as a subgroup of S,, and thus also as a group of 
automorphisms of the field of rational functions Q(X,,..., X;,) in variables over 
Q. If the fixed field K of G is likewise a field of rational functions in n variables 
over Q, the given group G is isomorphic to the Galois group of a Galois extension 
L/Q with ground field Q. 

The conjecture that the field K so obtained always satisfies the condition just 
stated is generally attributed to Emmy Noether, although her 1917 work has no hint 
of it; nor does Hilbert’s foundational work of 1892 contain any intimation in this 
direction. In 1969 a counterexample to the conjecture was exhibited by Swan, for G 
acyclic group of order 47 (Invent. Math. 7, 148-158). Thus the central problem of 
inverse Galois theory is not to be put to rest so easily; and yet the methodical study 
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of function fields remains by all means a fruitful approach. (One may also remark 
that the counterexamples found by Swan involve only certain cyclic groups, that 
is, groups whose realizability as Galois groups over @ is known on other grounds 
anyway: see $14.10 in the Appendix.) 

The reader who wishes to get a glimpse of current work on inverse Galois theory 
is referred to Inverse Galois theory by G. Malle and B. H. Matzat’s (Springer, 1999) 
and to Generic polynomials: constructive aspects of the inverse Galois problem by 
C. Jensen, A. Ledet and N. Yui (Cambridge, 2002). 
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The Transcendence of z 


1. To prove the famous result, already stated in Chapter 2, that z is transcendent, 
we will frame it as a special case of a more general theorem that will be of use in 
other situations. Not that we shall be able to go deep into the fascinating territory 
of transcendental number theory, but this approach hopefully has the advantage of 
transparency, and in any case no shorter path to the transcendence of z is known 
to this author. The guiding ideas are taken from Drinfeld’s booklet,' with some 
necessary minor corrections to the exposition. 

Suppose that z is algebraic. Let 6; =iz, B2,..., Bm be the conjugates of iz. 
Since e!* = —1, we have 


(1 +e%)(1 + 82)... +eFm) =0, 
Multiplying out we get 
1+ > eb + chi the 4... 4 Bite thm — 0, 
J j<k 


Denote by a1, @2,...,@, those exponents Bj, B;+ Bx, ..., Bit++:+ 6m that are 
nonzero, and rewrite the preceding relation as 


(1) N +e% + e% +... + e% =0, 


where N €N and all the a; are nonzero by assumption. 
Now, a conjugacy map simply permutes the numbers 1, @2,...,@,. But then 
the existence of a relation (1) is precluded by Theorem | on the next page. 


2. As a stepping stone to Theorem | we state and prove an elementary approxima- 
tion property of the exponential function in connection with an arbitrary polynomial 


(2) S(X) = co Her X +++ emX™. 


' GL. Drinfeld, Keagpatypa kpyra 4 TpaHcleHAeHTHOCTB uucsa sz, Vishcha shkola, 
Kiev, 1976; German translation: Quadratur des Kreises und Transzendenz von x (Mathe- 
matische Schiilerbiicherei, 101), WEB Deutscher Verlag der Wissenschaften, Berlin, 1980. 
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First we claim that for every nonzero x € C and every j = 0,1,2,..., 


. . 
(3) fle = fit jixt Sx? tox) tx! tgs (ael™, 


where |qj(x)| <1. 
The questionable remainder of the series j!e* is x/*15; (x), where 


2 
x X 
1) GaGa). UDG DO 
Thus ‘ 
|x| || |x| 
6; <14+— ee i 
|6;(x)| tg aoe <e 


for x # 0. It follows that q; (x) := 6; (x) e~|*! does indeed satisfy lqj(x)| <1. 
Now multiply (3) by c;, for 7 = 0,1,...,m, and add together the resulting 
equalities. After a simple calculation, this leads to: 


Lemma. For f(X) as in (2), set 
(4) F(X) = f(X) + FOHK) +--+ £O). 


Then for every nonzero x € C we have 


(5) F(0)e* = F(x) +e Q(x), 

where 

(6) F(0) = 3 cj! 

and 

(7) Q(x) = Saataw with |qj(x)| <1. 
j=0 


3. We denote by @° the field of all algebraic numbers, that is, the algebraic closure 
of Q inC. 


Theorem 1. Let a,...,Q@, € Q@° \ {0} and a,,...,dy € Z be given, satisfying 
the following condition: For every automorphism o of Q° /Q there is a permutation 
8 € Sy such that oj = Os(;) and asi) = aj; for alli. Then there exists no nonzero 
integer a satisfying 


(8) aye"! + aze%2 +++» + ane" =a. 


Proof. The a; are roots of a polynomial g(X) = )~b; xX! € Z[X] of degree n. For 
a given prime p (which will be chosen later) we consider the polynomial f(X) 
defined by 
m 
(9) (p— DI f(X) = XP1g(X)? =: Do X/ € Z[X], 
p-1 
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where m = np + p—1. Now we form the polynomial F associated with f as in 
(4), and obtain, with the notations of the lemma, the equations 


F(0)e% = F(a) + e'™! O(a;). 
Multiplying by a; and adding together, we obtain, after using (8), 
(10) a F(0) ay F(a1) ~ ++» an F(Om) = aye!!! Ole) +++» + ane!" O(n). 
Now, it follows from the definitions that, for each 7, 
(11) F(a) = p-h(a) for some h € Z[X] of degree < np. 


(Note that a; is a root of f of multiplicity at least p, so all the derivatives of f of 
order up to p—1 vanish at a;.) 

Because the @1,...,@, are algebraic numbers, we can find b € N such that all 
numbers bak with k <n are algebraic integers. Then b?/(a;) is also an algebraic 
integer for every 7. Multiplying by 5? we get from the left-hand side of (10) an 
algebraic integer; since this number must also, by assumption, be invariant under 
all automorphisms, we have 


(12) ab? F(0) — a,b? F(a) —-+-—anb? F(an) € Z. 
Choose p large enough that 
(13) Ptabob. 


The number F(0), being equal to Se Peake /(p—1)!, must be an integer, and 
modulo p we have 
F(0) = ¢p-1 = b§ = bo mod p. 


For p as in (13), therefore, the first summand in (12) is not divisible by p. In view 
of (11), this shows that the whole sum (12) is not divisible by p, and a fortiori it 
is nonzero. We will thus have a contradiction with (10) if we show that for large 
enough p, 


(14) |a1b? O(a )el!! + +--+ anb? O(a) el"'| <1. 
By the definition of Q(x) in (7), we have 


(p-D!O@1< Ylel elt = bx] DS ley! [xl 
j=0 


j7=0 
. P . 
< |x e"(o i xt") 2 we bi xt") 


(For the last inequality, note that the c; arise from the b; by a polynomial law.) 
From this we see that, for? = 1,2,...,n, 


P 


MP 
(15) |aib? O(ox)| < QD! 
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with a constant M > 0 that depends only on the initial data, not on p. For large 
enough p, then, the right-hand side of (15) becomes as small as we please; in 
particular one can arrange for (14) to hold. 


Now, starting from Theorem 1, we would like to derive some more general 
transcendence statements. Following Weierstrass (see his Werke, vol. II), we first 


formulate the following result: 
Lemma. Suppose given algebraic numbers x1,...,Xm, all distinct. If (a); € 
C™”™ \ {0} for k =1,2,...,1r, form the product 


P=T1(e'e) = Sr aa® a Pentre, 
i J Fiveodr 


By collecting together terms having the same sums in the exponent, we obtain a 
representation 


(16) P= ce 
720 
with Zo, 21,22,... all distinct (where each c; is a sum of products a” oa sa? with 


Xj, +++: + Xj, = 2;). Then at least one of the c; in (16) is nonzero. 


Proof. Since the order of the x; is not involved, we can assume that they are ordered 
lexicographically as points in C = RY; ee > X2 >++: > Xm. If we define iz, for 
each k, as the lowest index such that at Z 0, and then set z; = x;, +-::+;,, the 
corresponding c; is nonzero; for if 

al) (r) xj, tet x; 

Aj 1.4 ON r 


is another nonzero summand, then j, = ix for each k and hence xj, +-+:+x;, < 


Xi beset ae i ane equality can only hold if 7; =i,,..., 7, =i,. Thus we see that 
(1) 0. 

Cj =a;. ate 

Theorem 2. Es ,X2,...,Xm are distinct algebraic numbers, then e*'!, e*2,...,e*™ 


are linearly independent over Q. 


Proof. We pick a finite Galois extension K/@ containing all the x;. By supple- 
menting the x;,X2,...,Xm with conjugates as needed, we can assume that every 
o €G = G(K/Q) effects a permutation of the x;; thus there is a well defined 
element of Sj, still denoted by o, such that ox; = Xi). Now suppose there is a 
nontrivial relation 


(17) Yaje*i =0, with aj € Q. 


We form the product 


(18) I] (x ae") = oe’, 
o J i=0 
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where according to the lemma some coefficient on the right-hand side, say co, is 
nonzero. Clearly each o effects a permutation of the z;, and it is easy to see that 
oz; = z; always implies c; = cj. By collecting conjugates together in (18) and 
renumbering appropriately, we obtain 


t 
(19) So bk by ee) =0, with b, €Q 
k=0 o 


and bo # 0. We next multiply (19) by 7, e~*”9, obtaining 


(20) So dk 2 os) = So de (x see?) Gg 
k O,T k 0,0 


Let H be the subgroup of all p € G such that zo = zo. Then the expression zz, —PZo 
vanishes if and only if k = 0 and p € H. Consider the nonvanishing members of 
the family (zx — eZo0)x,r for all k and p € H, and call them y1, y2,...; then (20) 
yields 


(21) r+) > n(X o) =0, 
i21 o 


with r; € Q, where rp = |G|-|H|-bo 4 0. Denoting by a1, a@2,...,@, the members 
of the familiy (s;)s,i, for all 7 and o € G, we finally get from (21) a relation 


n 
y aje“ =a 


i=1 


of precisely the form precluded by Theorem 1. 


As a consequence of Theorem 2, we obtain: 


Theorem 3 (Hermite—Lindemann). [fa 4 0 is an algebraic number, e® is transcen- 
dental. In particular, e is transcendental (and also 1, since e'™ = —1). 


Proof. If e* were algebraic, there would be a nontrivial relation 


age? +. aye" +a,e7% +---+ ane" =0 with a; €Q, 


contradicting Theorem 2. 


Theorem 3 was first proved by Lindemann, using methods developed by Hermite, 
who had already been able to use them to show the transcendence of e. 


A more general fact than Theorem 2 was stated by Lindemann and proved by 
Weierstrass: 


Theorem 4 (Lindemann—Weierstrass). If x1, ..., Xm are distinct algebraic numbers, 
e*1,e*2,...,e*™ are linearly independent over Q°. 
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Proof. Suppose 
aye"! +---+an,e*" =0, 


where @1,...,@m are nonvanishing algebraic numbers. Choose a finite Galois ex- 
tension K/Q containing all the a;. Setting G = G(K/Q) we form the product 


TH (Safe) = Dae®, 
ocG Sj i 


as in the lemma. One easily sees that each c; is invariant under all the o € G; thus 
c; € Q. In view of the lemma, this leads to a contradiction with Theorem 2. 


As an exercise, derive from Theorem 4 the following fact: If a chord of the unit 
circle has as its length a (nonzero) algebraic number, the length of the corresponding 
arc cannot be constructed with ruler and compass, and neither can the area of the 
corresponding sector. 
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Fundamentals of Transcendental Field Extensions 


1. Let E/K be a fixed field extension. Let M be a subset of E. By the algebraic 
closure of M in E we understand the algebraic closure of K(M) in E (see F8 in 
Section 2.5). We denote this field by 


H(M) = He /x(M). 


We say that an element @ in E is algebraically dependent of M (over K) if a lies 
in H(M), that is, if w algebraic is algebraic over K(M). Clearly 


(i) MC A(M); 
(i) MCM’ => H(M)C H(M)); 
(iii) H(H(M)) = H(M). 


Definition 1. We say that M is algebraically independent (over K) if 
a é H(M ~{a}) forallaeM, 


that is, if every a in M is transcendental over K(M ~ {a}). Otherwise we say that 
M is algebraically dependent (over K). 


The formal analogy between these notions and those of linear (in)dependence, 
familiar from linear algebra, is self-evident. The following statements are also clear: 


(iv) M algebraically dependent if and only if there exists some a € M that is 
algebraic over K(M ~ {a}). 
(v) An element a € M lies in H(M ~ {a}) if and only if H(M) = H(M \~ {a}). 
(vi) If @ is algebraic over K(M) and does not lie in MW, then M U {a} is alge- 
braically dependent. 
(vii) M is algebraically dependent if and only if M has a finite subset that is still 
algebraically dependent. 
(viii) M is algebraically independent if and only if every finite subset of M is 
algebraically independent. 
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Fl. M is algebraically independent if and only if, for any distinct elements 
Q1,...,Q, of M (where n is any positive integer), the canonical homomorphism 
of K-algebras from the polynomial ring K[X1,..., Xn] into E defined by 


Xirra; fori<i<n 


is injective; in other words, if and only if there is no nontrivial algebraic relation 
linking the oj (the meaning of this expression being that f(a1,...,Qn) = 0 implies 
f =0 for fe K[M,..., Xn). 


Proof. Without loss of generality, we can assume that M = {a1,...,@,} has n 
elements. 


(1) Assume that M is algebraically dependent. Then there exists i such that a; is 
algebraically dependent of M ~ {a;}; we may as well suppose it’s i =. Therefore 
Qy is algebraic over K(a1,...,Q@n—1) = Frac K[a,...,@,—1]. Thus 


m 
S nGyacGaie, =0, 
i=0 


for certain polynomials g; = gj(X1,..., Xn-1) in K[X%1,..., Xn—1], the last of 
which satisfies 2)(@1,...,@n—1) # 0. If we set 


m 
$C MSY HOG ee, 
i=0 
the polynomial fe K[X1,..., Xn] satisfies f(a1,...,@,)=0 but f(%,..., Xn) 40. 


(2) Conversely, assume instead that M = {a1,...,@n} is algebraically independent, 
and suppose that f(a1,...,@,) = 0 for some f € K[X1,..., Xn]. We must show 
that f = 0. Now, f has a representation of the form 


(1) f(Mis- Xa) = > gi ise. Mea) Xp, 
i=0 


with uniquely determined polynomials g;(X1,..., Xn—1) € K[X1,..., Xn—1]. Since 
f(a1,...,@n) = 0, we have 


m 
(2) >" gi(@1,-..,Gn-1) a4, = 0. 

i=0 
This is an algebraic equation for a, over K(a1,...,@n—1). By assumption it must 
be trivial, that is, g;(@1,...,@,-1) = 0 for all 7. By induction this implies that 


gi(X,..., Xn-1) = 0 for all i, and hence f = 0. 
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F2. Let M be a subset of E and a any element of E. 


(a) If M is algebraically independent but M U {a} is algebraically dependent, we 
have a € H(M), that is, a depends algebraically on M. 


(b) If B is a maximal algebraically independent subset of M , then M is contained 
in H(B), that is, every element of M is algebraic over K(B). 


Proof. (a) Under the assumption, there exist distinct elements a@1,...,Q@,—; and 
dyn =a in M U {a} satisfying a nontrivial algebraic relation 


(3) f(@1,...,@n) =0 with f € K[X,,..., X,] nonzero. 


Write f in the form (1); not all the g;(X1,..., X,—1) vanish. Because of (3), the 
relation (2) is satisfied. Here not all the gj(a1,...,@,—1) can vanish; for otherwise, 
because of the algebraic independence of a@1,...,@n,—1 over K, all the g; would 
vanish, by Fl. Therefore (2) represents a nontrivial algebraic equation for a, over 
K(q@1,...,Q@,-1), meaning that a, is in fact algebraic over K(a1,...,Q@,—1) and 
hence also over K(M). 


Part (b) is an immediate consequence of (a). 


Definition 2. A transcendence basis of a field extension E/K is a set B C E such 
that 


(i) E = H(B) (so the extension E/K(B) is algebraic), and 
(ii) B is algebraically independent (over K). 


F3. If B is a subset of E, the following conditions are equivalent: 
(i) B is a transcendence basis of E/K. 


(ii) If B is contained ina subset M of E such that H(M)= E, then B is amaximal 
algebraically independent subset of M. 


(iii) There exists a subset M of E such that H(M) = E and that B is a maximal 
algebraically independent subset of M. 


Proof. (i) => (i): Take a € M ~\ B. We must show that B U {a} is algebraically 
dependent. But this is clear because a € H(B) = E, according to statement (vi) 
after Definition | (with B playing the role of M). 

(li) => (iii): Take M = E. 


(iii) => (i): All we have to show is that H(B) = E. By F2(b), M is contained in 
H(B). Therefore E = H(M) C H(A(B)) = H(B). 


Theorem 1. Every field extension E/K has a transcendence basis. More precisely: 
Given a subset M of E such that E/K(M) is algebraic, and given a subset C of 
M that is algebraically independent over K, there exists a transcendence basis B of 
E/K such thatC CBCM. 
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Proof. We must enlarge C to make it a maximal algebraically independent subset 
B of M; by F3, such a set is a transcendence basis of E/K. If M is finite, the 
existence of B is clear. If M is infinite, one resorts to Zorn’s Lemma, the argument 
being wholly similar to the one used for the proof of Chapter 6, F11. 


F4. Let E/K be a field extension and M a subset of E such that E/K(M) is 
algebraic. If C is any subset of E algebraically independent over K, there exists a 
subset M’ of M disjoint from C and such that C U M'’ is a transcendence basis of 
E/K. 


Proof. By Theorem 1, there exists a transcendence basis B of E/K such that 
CCBCMUC. Now set M’:= BX C. Then M’ and C are disjoint, and their 
union B is a transcendence basis of E'/K. 


Theorem 2. Any two transcendence bases of a field extension E/K have the same 
cardinality. 


Proof. Let B and B’ be transcendence bases of E/K. If B and B’ are both infinite 
sets, the desired assertion follows easily on set-theoretical grounds (see $18.2 in the 
Appendix). 

We now prove the assertion for the more interesting case, where E/K has a 
finite transcendence basis. Let B be an n-element transcendence basis of E/K and 
C = {a1,...,@m} an m-element algebraically independent subset of E. It suffices 
to show that in these circumstances m does not exceed n. 

Assume for a contradiction that m >n. We will show by induction that for every 
integer & such that 0 < k <n there exist subsets 


(4) Bo > Bi 2-+- > By 

of B such that, for each k, 

(5) {a1,...,a@,}U By is a transcendence basis of E/K 
and 

(6) {ay,...,a,}N Bp =D. 


For k = 0 we take Bo := B. Assume the assertion is true for 0 < k <n. By F4 
there is a subset By of the set {a;,...,a,}U By, that satisfies the conditions 


(7) {a1,...,@%41}U By 1 is a transcendence basis of E/K 


(8) {ay,...,Q@n4 1} Beyy =. 


Then B+; is necessarily contained B,. Now, Bg4; and By cannot be equal, since 
otherwise By U{a1,...,a%}U{a%41} would be algebraically independent according 
to (7), yet algebraically dependent according to (5). By virtue of (4), By has at most 
n—k elements. Therefore B, is empty. Thus {a1,...,@,} is a transcendence basis 
of E/K, by (5). Because C = {a1,...,@m} is algebraically independent, it cannot 
happen that m > n. 
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Definition 3. The transcendence degree TrDeg(E/K) of a field extension E/K is 
the cardinality of any transcendence basis of E'/K. 


Definition 4. A field extension E/K is called purely transcendental if E/K has a 
transcendence basis B for which EF = K(B). 


Remarks. (1) If £/K is purely transcendental with transcendence basis B, then 
E is K-isomorphic to the fraction field of the polynomial ring K[B] in the 
variables XY € B over K. 


(2) By Theorem 1, any field extension E'/K has an intermediate field F' for which 
F/K is purely transcendental and E’/F is algebraic: 


E 


algebraic 


(9) F 


purely transcendental 
K 


Of course, F is not unique. 


Theorem 3. Let F be an intermediate field of E/K, and let B and B’ be transcen- 
dence bases for F/K and E/F, respectively. Then B( B’ = @ and BU B’ isa 
transcendence basis for E/K. In particular, 


(10) TrDeg(E/K) = TrDeg(E/F) + TrDeg(F/K). 


Proof. Suppose @ lies in both B and B’; then a, being an element of F, is algebraic 
over F(B’ \ {a}). But because a € B’, this contradicts the algebraic independence 
of B’ over F. Thus BN B= 9. 

Next we show that E/K(B U B’) is algebraic. But assumption, F/K(B) is 
algebraic; then so is the extension 


FK(B’) / K(B)K(B’) = F(B’) / K(BUB’). 


Also by assumption, E'/F(B’) is algebraic. Since algebraicness is transitive, the 
conclusion follows. 

There remains to show that BU B’ is algebraically independent over K. By F4, 
we know there is a subset B” of BU B’ such that BN B” = © and that BU B” is 
a transcendence basis for E/K. Clearly B” is contained in B’, so if we prove that 
B’ © B” we are done. Suppose there exists a € B’\ B”. Being an element of E, 
this @ is algebraic over K(B U B”) = K(B)(B”"), and so also over F(B”). Since 
B" C B’ and because of our assumption, @ is then algebraic over F(B’ ~\ {a}), 
contradicting the algebraically independence of B’ over F. 
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2. A fact that comes in handy on many occasions is that, for finitely generated 
extensions, Theorem | (and Remark 2 to Definition 4) can be sharpened: 


Theorem 4 (Noether’s Normalization Theorem). Let A be a commutative algebra 
over a field K, and suppose that A is generated (as an algebra) by finitely many 
elements X1,...,Xn, meaning that 


(11) A= K[x1,...,Xnl. 
Then there exists some m < n (possibly zero) and elements u,,...,Um of A with the 
following properties: 
(a) The subalgebra K|u1,...,Um] of A is a polynomial algebra over K with uy, 
..+;Um as indeterminates (if m = 0, by convention, K[u1,...,Um] = K). 
(b) The ring extension A/K|u1,...,Um] is integral, and therefore, by (11), it is in 
fact finite. 
Proof. We may as well assume that x1,...,X, are all distinct. We use induction 
on n. The case n = | is clear. Suppose that n > 1 and that the assertion holds 
for n—1. If the x,,...,x, satisfy no nontrivial algebraic relation over K, there is 


nothing to show; therefore we assume instead the existence of a nonzero polynomial 
in f € K[X,..., Xn] such that 


(12) PCat ke) SV, 


Let the explicit expression for f be 


(13) f= » AS Ci meee, Oe 


v=(115.-.Yn) 


Further, let (42, 43,..., 4n be natural numbers (about which we will have more to 
say later). Setting 


(14) Vi =xi-xi" for2<i<n, 


equation (12) becomes 


(15) f(X1, Vr tXP?, 06, In FH") =. 
For notational simplicity we define R= K[X1,..., Xn]. Consider, in the polynomial 
ring R[Y2,..., Y,] in m—1 variables over R, the polynomial 


F(X, 2 + XI, 2. Yn + XN). 
As a polynomial in X; over K[Y2,..., Y,], this has the form 


SS 7D, Ca ir cs +4 (xX) ; Y>, ee Yn), 
v 
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where g(X1, Y2,..., Yn) is a polynomial whose degree in Xj is less than the degree 
of the polynomial on the left — it being assumed that we arrange for the summands 
in the sum not to cancel one another. Now we take care of the [12,..., {tn. First 
define w = (1, 2,..., fn) and denote by wv = 1 vy + ove +--+ envy the usual 
inner product of jz with v = (11,...,v,). Let p be a natural number such that 


p> deg f = max{v) +--+ yn | cy FO}. 


We now choose 
PAUSE asp’): 


For distinct n-tuples v = (11,..., vy) and v’ = (vj,...,),) such that cy # 0 and 
cy #0 we have pv ¥ jv’, because v;, v; < p for all i. (The expansion of a natural 
number in powers of p is unique.) Therefore 


F(X, ¥2 + XM,...,%n + XM") = XN + A(X1,¥2,...,¥n) 


with c #0 in K and some polynomial / of degree less than N in X1. 

Dividing this equality by c and substituting x1, y2,..., ¥n for X1, Y2,..., Yn, 
we get using (15) an integrality equation for x; over K[y2,..., Yn]. In view of (14) 
and (11), then, we conclude that the ring extension 


(16) A/K|y2,..-;¥n] is integral. 


Now, by the induction hypothesis, there exist elements u1,...,Um of K[y2,..., yn] 
with the following properties: 


(17) K[uy,...,Um] is a polynomial ring in w1,...,uUm over K; 
(18) Ky2,..-, Yn] /K[u1,...,Um] is integral. 


This proves the desired assertion, because according to (16) and (18) the extension 
A/K[u1,...,Um] is also integral (see F6 in Chapter 16). 


Here is a remarkable consequence Noether’s Normalization Theorem: 


Theorem 5. Let E'/K be a field extension, and suppose E is finitely generated as a 
K-algebra. Then E/K is algebraic. 


Proof. By Theorem 4, £ has as a subring some polynomial ring over K in finitely 
many indeterminates, say F = K[u1,..., Um], with the further property that E/F is 
integral. Since E is a field, F11 in Chapter 16 says that F must also be a field. But 
this is only possible for ™ = 0, because in a polynomial ring over a field K there 
are no invertible elements outside K*. Now, m = 0 implies that E/K is integral, 
which is to say algebraic. 


Remark. Theorem 5 represents an important generalization of the fundamental 
fact, learned long ago, that an element a of a field extension over K is algebraic if 
and only if K(a@) = K[a] (Chapter 3, Fl). At the same time, a more direct proof 
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of Theorem 5 can be given by induction on the number of generators, as we now 
show. 

Suppose £ = K[x1,...,Xn], with n > 0 (the case n = 0 being trivial). Then EF = 
K(x1)[X2,...,Xn], and the induction hypothesis implies that each x; is algebraic 
over K(x 1). We set ¢ = x1. If ¢ is algebraic over K, we are done. So suppose ¢ is 
transcendental over K. Then, by F7 in Chapter 16, there is a nonzero polynomial 
h = h(t) € K[t] such that hx; is integral over K[t] for all i. For any f in E = 
K[x1,X2,...,Xy], therefore, h°f is integral over K[t] for some appropriate power h® 
of h. In particular this holds for all f in K(t). But K[t] is integrally closed in K(t), 
so every rational function f can be represented as a quotient g/h° of polynomials 
in t, where the denominator is a power of a fixed polynomial /, independent of /. 
This is impossible. 
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Hilbert’s Nullstellensatz 


1. Transcendental field extensions come up naturally in the algebraic treatment of 
geometric problems. This chapter will serve as a appetizer for the feast that awaits 
the reader who wishes to delve deeper into the field of algebraic geometry (see for 
example Hartshorne’s textbook of the same name). 


Definition 1. In this chapter we will be working with a fixed field extension C/K 
and with the polynomial ring K[X1,..., Xn] in variables over K. If M is a subset 
of K[X1,..., Xn], we set 


N(M) = {(01,...,%n) © C” | f(%1,.--,Xn) =0 for all fe M}. 


A point x = (x1,...,X,) in N(M) is called a zero of M in C”. The set N\(M) itself 

we call the the zero set of M in C”. Instead of N(M) we sometimes use the more 

precise notation Nc(M), to exhibit the dependence on the chosen extension C of 

K. The subsets V of C” of the form V = N(M) for some M C K[X1,..., Xn] are 

called affine algebraic sets of C” defined over K, or algebraic K-sets of C” for short. 
Conversely, if N is a subset of C”, we define 


I(N) ={f © K[M,...,Xn]| f1,-.-,%n) =0 for all (01,...,%n) € N}. 


If f lies in §(N) we say that f vanishes on N. Sometimes we write x (NV) instead 
of #(N) to make K explicit. 


F1. The following formal properties hold: 
(i) MCM' => N(M’)CN(M). 
(ii) Ifa = (M) is the ideal of K|X1,..., Xn] generated by M, we have N(M) = 
N (a). 
(iii) For every N € C", the set §(N) is an ideal of K[X,,..., Xn], called the ideal 
of N in K[X1,..., Xn]. 
(iv) NCN' > G(N') CSN). 
(v) M CIN(M) for every M C K[X,..., Xn]. 
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(vi) N CNI(N) for every NCC". 

(vii) For any family (aj)iexr of ideals in K[X,,...,Xn] we have (\;<,N(ai) = 
N (ier ai), where the sum )_;<, 9: of the given ideals a; is defined as the 
ideal generated by the union |), 4i- 

(viii) N(a) U N(6) = N(ab) = N(aN 6) for any ideals a, 6 of K[X,,..., Xn], where 
the product ab of a. and 6 is defined as the ideal of K[X1,..., Xn] generated by 
all products fg with f €aand g €b. 


(ix) V=NI(V) for any algebraic K-set V of C”. 
Proof. We prove (viii) and (ix), leaving the others to the reader. 
(viii) Clearly ab CaMb Ca, 6, so (i) yields 
N(a) UN(6) C N(aN b) CN (ab). 
Thus what is left to show is that N(ab) C N(a) U N(6). Let x € C” be a zero of 
ab but not of a. Then there exists f € a such that f(x) 4 0, and for every g € b 


we must have g(x) = 0, since f(x)g(x) = fg(x) = 0. Consequently x belongs to 
N(6), as desired. 


(ix) First let M be any subset of K[X1,..., Xn]. From (v) we get M C N(M), 
which yields the inclusion N.S.N(M) CN (M) because of (i); on the other hand, (vi) 
shows that N(Z) C N¥(N(M)). Thus, for every M C K[X,,..., Xn], we have 


(x) N(M) = NIN(M). 


But the algebraic K-sets of C” are precisely those subsets of the form V = N(M), 
so (x) is tantamount to (ix). 


For an arbitrary ideal a of K[X1,..., Xn], it is generally not the case that a = 
N(a); that is, a does not necessarily coincide with the ideal of its zero set in C”. 
This is just because not every ideal occurs as the ideal of some subset of C”: if 
a= $(N) we obviously have, for every natural number m, 


(1) fea > fea. 


Thus, for instance, the ideal a= (X?,..., xe) of K[X1,..., X,] is not of the form 
a= (N). 


Definition 2. Let a be an ideal of a commutative ring R. The set 
Va:={feR|im: f™ ea} 
is an ideal of R, called the radical of a. An ideal a of R is called reduced if a= Ja. 


Remark. If R is a commutative ring, one can in particular consider the radical J0 
of the zero ideal 0 := (0). By definition, this is the set of all nilpotent elements 
of R, and for this reason it is called the nilradical of R. If a is an ideal of R, its 
radical ,/a is the inverse image of the nilradical of the quotient ring R/a under the 
quotient map. 
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It follows from these definitions that ideals of subsets of C” must be reduced. 
But for arbitrary extensions C of K, condition (1) is generally not sufficient to 
ensure that an ideal a of K[X1,..., X;] is an ideal of a subset. For example, take 
K =C=R and n = 2, and consider the principal ideal 


a= (XP 4+ Xp +--+ X7) 


of R[X1,..., X;]. It is easy to see that Xf +: --+X? is irreducible in R[X1,..., Xn]. 
Hence a is a prime ideal and as such it is reduced; and yet N(a) = {0:= (0,..., 0)}, 
so #(N(a)) = £({0}) = (X%1,..., Xn). Therefore a is not of the form $(NV), since 
for any N C C” there is an equality dual to (x) above: 


(xi) I(N) = ING(N). 
If C is an algebraically closed field, however, the condition is sufficient: 


Theorem 1 (Hilbert’s Nullstellensatz, algebraic form). Let C/K be a field extension 
where C is algebraically closed. With the preceding notations, every ideal a of 
K[X1,..., Xn] satisfies 


(2) IN (a) = Ja. 


Thus N constitutes a bijection between the set of reduced ideals of K[X1,..., Xn] 
and the set of algebraic K-sets of C". The map inverse to N is §. 


“Nullstellensatz” is German for “Theorem on zeros”. We will trace Theorem 1 
back to the following fact, which has intrinsic interest as well: 


Theorem 2 (Hilbert’s Nullstellensatz, geometric form). Let K be a field and C an 
algebraically closed extension of K (for instance, an algebraic closure of K). Let an 
ideal a of the polynomial ring K[X1,..., Xn] be given. Provided that a is not all of 
K[X,,..., Xn], there exists in C" a common zero (21, ...,2Zn) of all the polynomials 
f €a: in symbols, 

Ne(a) #2. 


Proof. The assumption a 4 K[X1,..., Xn] implies (see F12 in Chapter 6) that 
K[X1,..., Xn] has a maximal ideal m that contains a. We look at the quotient 
homomorphism 


(3) K[X,..., Xn] > K[M,..., Xn] /m. 
Denote the images of X1,..., Xn by X1,..., Xn, respectively. Then 
(4) K[X,..., Xn] /m = K[x1,..., Xn], 


and because m is maximal, K[x1,..., Xn] is a field. This implies, by Theorem 5 in 
Chapter 18, that the extension K[x1,...,Xn]/K is algebraic. Thus there exists (by 
Theorem 3 in Chapter 6) a K-homomorphism 


o: K[x1,...,Xn] > C. 
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Let 27] = 0X1, 22 =OX2, ..., Zn = OXn be the images of X1,...,X», under o. In 
view of (3) and (4) we have 

f(41,.--,2n) =9 forall fem. 


In particular this equality is fulfilled for all f € a, since aC m. Thus we have found 
a point (Z1,...,2Zn) € C” where all the polynomials in a vanish. 


Now that Theorem 2 has been proved, we derive Theorem | from it using the 
so-called “Rabinovich trick”, which is no more than an adaptation of the elementary 
technique of clearing denominators: 

Let a be any ideal of K[X1,..., Xn], and let f € ¥N(a) be a polynomial of 
K[X,,..., X»] that vanishes on the zero set (a). We must show that there exists 
a natural number m such that 

f™ €a. 
To this effect we will regard K[X1,..., X,] as a subring of the polynomial ring 
K[X,,..., Xn, Xn41] in the n+ 1 variables X1,..., Xn, Xn+1, and consider the 
ideal 2 of K[X1,..., Xn, Xn41] generated by a and the element 1—X,4, /: 


(5) A= (a,1-Xyii f). 

First assume that 2 is not all of K[X1,..., Xn+1]. By Theorem 2, then, 2 has a 
Zero (Z1,.--,Zn+1) € Crk, By definition the point (Z1,..., Zn+1) € N(2l) satisfies 
(6) g(Z1,.--,2Zn) =0 forall g €a, 

(7) SF (41,.-++32Zn)Zn¢1 = 1. 

From (6) it follows that (z1,...,2.) € N(a); thus our f € #N(a) must satisfy 
f(41,..-,2Zn) = 0. But this contradicts (7), and we conclude that our assumption is 


untenable; that is, the ideal 2{ in (5) is all of K[X1,..., Xn41], so 1 € 2. Thus by 
looking at (5) we see there is a relation of the form 


(8) 1=) higi th —Xnsif), 


for certain polynomials g; € a and h;,h € K[X,..., Xn41]. We can assume from 
the beginning that f 4 0. Applying to (8) the homomorphism K[X1,..., Xn41] > 
K(X1,..., Xn) defined by the substitutions 


Xj X; forl<i<n and Xy4iP 1/f, 


we get a relation 


ED ee Crees Crea Bai crmetre oF 


Multiplying by an appropriate power f™ of f, then, we obtain 
(9) f° = Bi Bien nb Miee Xe) 
i 


for certain polynomials hi K[X1,..., Xn]. Since gj € a this implies f” € a, and 
Theorem | is proved. 
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2. We now complement the preceding discussion of algebraic sets with the follow- 
ing fundamental theorem: 


Theorem 3 (Hilbert Basis Theorem). Let K be any field. Every ideal a of the 


polynomial ring K[X1,..., Xn] is finitely generated, that is, there exist finitely many 
polynomials fi,..., fm in K[X1,..., Xn] such that 
a=(fi,..., fm). 


In particular, if C is any extension of K, any algebraic K-set V of C” is the zero set 
of a finite family of polynomials f,,..., fm in K[X1,..., Xn]: 


VEtiectwyeC” | HOt ua) =0 for 1=i =m}. 
In this situation we write V = N(fi,..-, fm): 


Theorem 3 arises directly from the following general statement, which is also 
often called the Hilbert Basis Theorem, although Hilbert himself never stated it 
explicitly: 


Theorem 4. /f R is a commutative ring with unity in which every ideal is generated 
by finitely many elements (such a ring is called Noetherian), then every ideal of the 
ring R[X] of polynomials in one variable X over R is also finitely generated. 


Proof. Let a be an ideal of R[X]. For each integer m > 0, consider the set 
Cm = {a|a is an m-th coefficient of a polynomial f € a of degree at most m}. 


Apart from 0, then, the elements of ¢, are the highest coefficients of polynomials 
of degree m contained in a. Clearly cm is an ideal in R. If f(X) is a polynomial 
of degree m in a, then Xf(X) is a polynomial of degree m+ 1 in a; we thus obtain 
a chain 


(10) co Sy Se** Sm S my Ss 


of ideals in R. Therefore the union c of all the c; is also an ideal in R. Because R is 
assumed Noetherian, c is finitely generated, so the chain (10) terminates, meaning 
that there is some 7 for which 


(11) co Sey Se# Oy Seng St SH 
By assumption, all the ideals co, ¢1,..., ¢, are finitely generated; suppose, say, that 
(12) = (ci,...,Gr) for0O<i<n 


(where we have uniformized the size r of the generating sets for 0 <i <n by, say, 
repeating generators). By the definition of the c; there is, for all 0 <i <n and 
1<j <r,a polynomial fj; € a of the form 


(13) fy = cy X' + polynomial of degree less than i, 
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with the same c;; as in (12). Now let a be the ideal of R[X] generated by all the 
Jij- We wish to show that a = a, which will prove Theorem 4. 

It is clear that a is contained in a. Let f be a polynomial of degree m in a, and 
let a be its leading coefficient: 


f(X) =ax"4+---, witha #0. 
By definition, a € ¢,. We claim that if 1 is chosen as in (11), then 
aéc; forsome i <m,n. 


If m <n this is clear: just take i = m. For m > n, on the other hand, the claim 
follows from (11). Now a, being an element of ¢; with 0 <i <n, can be written as 


A= a1Cj1 +42Cj72 +++: +arcir, With aj € R. 


Then the polynomial of defined by 
” r 
(14) f=f > Ge 
j=l 


had degree less than m, because of (13). On the other hand, the same polynomial 
lies in a, since fj; € a Ca. By induction we can therefore assume that f € a. By 
(14), then, f itself lies in a. 


3. The Hilbert Basis Theorem has some fundamental consequences for algebraic 
sets, which we briefly discuss. As before, C/K will be an arbitrary but fixed field 
extension. 


F2. Every nonempty set of algebraic K-subsets of C" has a minimal element. 
Proof. Clearly this is equivalent to saying that any descending chain 


(15) Yi 2 V2 2-2 Vm 2 Vnti1 2° 


of algebraic K-sets V; in C” terminates, that is, satisfies V, = V,1 =--- for some 
n. An application of ¥ to (15) yields the ascending chain of ideals 


(16) (Vi) S (V2) S++» S (Vin) S (Vint) S++. 


This chain we already know to be terminating, since the union of the £(V;) is 
itself an ideal of K[X1,..., Xn], and so must be finitely generated by Theorem 3. 
Applying N to (16) and keeping in mind F1(ix) we recover the chain (15), which is 
thus seen to terminate. 


Definition 3. An algebraic K-set V 4 @ in C” is called irreducible if it cannot be 
expressed as a union V = V, U V2 of algebraic K-sets V;, V2 of C” distinct from 
V. An irreducible (affine) algebraic K-set in C” is also called an (affine) K-variety 
of C”. 
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F3. An algebraic K-set V of C” is irreducible if and only if its ideal §x(V) is a 
prime ideal of K[X,,..., Xn]. 
Proof. (1) Let V be irreducible and suppose fg € J(V). Then 


V=N(I(V)) SN(g) = N(P)UN(8), 


so V = V; UV. where Vi = N(f) OV and V2 = N(g)/N V are algebraic K-sets. 
Because V is assumed irreducible we have (say) Vi = V, so V C N(f). But then 
tf € I(N(S)) € #(V). Since V is nonempty, moreover, we have 1 € J(V). 

(2) Let $(V) be a prime ideal, and suppose V = Vi U V2 with Vi, V2 algebraic 
K-sets in K”; suppose moreover that V; # V. First note that 


(17) IV) = (YU V2) = (V1) NF (V2) D (V1) $ (V2). 


Now $(Vi) 4 #(V), otherwise we would have Vj = N($(Vi)) = N(F(V)) = V,, 
contrary to the assumption V; 4 V. Thus there exists f €. (V;) such that f € (V). 
But §(V) is a prime ideal; therefore #(V2) C #(V), by (17). It follows that V C Vz 
and hence that V is irreducible. 


Remark. Affine space K" (being the zero set of the zero polynomial) is itself an 
algebraic K-set of K”. If the field K is infinite, moreover, §(K”) is the zero ideal 
and hence K” is irreducible, by F3. In particular, K” cannot, in the case of K 
infinite, be expressed as a union of finitely many proper subspaces (this a favorite 
linear algebra exercise). If K is finite, of course, K” is not irreducible. 


Note also that every algebraic K-set V of C” is also an algebraic C-set of C”. 
But clearly being irreducible as an algebraic K-set of C” does not imply being 
irreducible as an algebraic C-set of C”. 


F4. Any algebraic K-set V of C” can be written as a finite union 
(18) V=V,UV2U-:-UV, 


of irreducible algebraic K-sets V;. If we demand that V; EZ Vx fori # k in (18), this 
representation is unique up to order; the V; are called the irreducible K-components 
of V. Every K-variety W contained in V is contained in one of the V;. 


Proof. (1) Let be the set of all algebraic K-sets of C” that cannot be expressed as in 
(18). If Lis nonempty, it has a minimal element V by F2. This V is not irreducible 
by assumption, and so it is of the form V = V; UV2, where V;, V2 are algebraic sets 
strictly contained in V. But by definition V; and V2 both have representations of 
the form (18), and therefore so does V, contradicting the assumption that it belongs 
to U. 


(2) We next show the last assertion of F4. From (18) we have 
W =(WNV,)U---U(WOY,). 


Since each W 1 V; is algebraic and W is irreducible, there exists V; such that 
W=WNV;,, and thus W C J;. 
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(3) To prove uniqueness, let V = Vj U---UV, and V = W; U---U W, be two ways 
to write V as a finite union of irreducible algebraic K-sets. By (2) there exists for 
each W; some V; such that W; C V;. Likewise there exists for this V; some Wx 
such that V; C W;; thus W; C V; C Wy. The noninclusion assumption then implies 
that 7 = k and hence W; = Vj; that is, each W; coincides with some V;. Likewise 
for each Vz; there is some W; such that V; = W). The assertion follows. 


Definition 4. Let V be an algebraic K-set of C”. The K-algebra 


(19) K[V]:= K[XM,..., Xn] /I(V) 


is called the affine coordinate ring of V. Clearly K[V] can be identified with the 
ring of all functions V —> C that arise from polynomials in K[X1,..., Xn]. Being 
a homomorphic image of K[X1,..., Xn], this K-algebra has the form 


(20) K[V] = K[x1,..., Xn], 


and so is finitely generated as a K-algebra. In the sequel we will call any finitely 
generated commutative K-algebra A an affine K-algebra. If the algebraic K-set V 
of C” is irreducible—in which case we speak of an (affine) K-variety of C” — 
then K[V] is an integral domain (see F3); the fraction field 


(21) K(V) := Frac K[V] 
is then called the field of rational functions of the K-variety V . 
Remarks. (i) If C is algebraically closed, V = C” is a K-variety, and K(V) is 


the field K(X,,..., X,) of rational functions in n variables over K. 


(ii) Suppose that the affine K-algebra A = K[x1,...,Xn] is an integral domain. 
Then A, being a homomorphic image of K[X1,..., Xn], is isomorphic to 
K[X,..., Xn]/p, where p is a prime ideal. Thus, if C is algebraically closed, 
A is isomorphic to the affine coordinate algebra of the K-variety V = N(p) of 
C” (see Theorem 1). 


(iii) Again let A= K[x1,..., Xn] be an integral domain. The transcendence degree 
of A is of course defined as that of the field extension Frac(A) /K: 


TrDeg(A/K) := TrDeg(Frac(A) /K). 
Since Frac A = K(x1,...,Xn), we have TrDeg(A/K) <n. 


(iv) Any affine K-algebra is Noetherian, since it is a homomorphic image of a 
polynomial ring K[X1,..., Xn] (see Theorem 4). 


4. In this last section we examine the notion of dimension for algebraic sets. 
We continue to work with a fixed field extension C/K, and make the additional 
assumption throughout the section that C is algebraically closed. 


Definition 5. The dimension dim V of an algebraic K-set V of C” is the supremum 
of all integers m for which there is a strict chain Vo C Vj C--- C Vy of K-varieties 
of C”, all contained in V. 
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Natural as this definition appears, it is not self-evident that dim V is finite. In 
view of Theorem | and F3, however, Definition 5 suggests two related notions: 


Definition 6. Let A be any commutative ring with unity. The height h(p) of a prime 
ideal p of A is the supremum of all integers m for which there is a strict chain 
Po Chi C-::C Pm =p of prime ideals p; in A. The dimension (more precisely, 
Krull dimension) of the ring A is the supremum of the heights of all prime ideals 
of A, and is denoted by dim A. 


Even if A is assumed to be Noetherian, it is not a priori clear that every prime 
ideal p of A satisfies h(p) < oo. This nonetheless turns out to be the case; on the 
other hand, there exist Noetherian rings A such that dim A = oo (see Matsumura, 
Commutative algebra). 

Regardless of the finiteness of the numbers in question, we have: 


F5. For every algebraic K-set V of C”, 
(22) dim V = dim K[V]. 


Proof. By F3 and Theorem 1, K-varieties W contained in V are in one-to-one 
(and inclusion-reversing) correspondence with prime ideals of the polynomial ring 
K[X,,..., Xn] that contain (V). But the latter correspond exactly to the prime 
ideals of K[X1,..., Xn]/F(V) = K[V]. The assertion follows. 


Remark. It is easy to see that 
dim V = max(dim Vj, ... , dim V,) 


if V = V; U--- UV, is the decomposition of V into irreducible components given 
by F4. Therefore we will restrict our attention from now on to K-varieties V. Take 
the conceivably simplest case of the K-variety V = C!. Then K[V] = K[X] is 
the polynomial ring in one variable over K. But K[X’] is a principal ideal domain, 
so every prime ideal of K[X’] is either a maximal ideal or the zero ideal of K[X]. 
Thus the only K-varieties contained in V = C ' are (besides V itself) the root sets 
of irreducible polynomials of K[X]. Thus for V = C! we have 


dim V = dim K[V] = 1 = TrDeg(K[V]/K). 


In general the dimension of an arbitrary K-variety V can be given the following 
description, which also makes the finiteness of dim V manifest: 


Theorem 5. For any K-variety V of C”, 
(23) dim V = TrDeg(K[V]/K). 
In particular, dim V is at most n (and for instance dim C” = n). 


We refer the proof of Theorem 5 to an auxiliary result: 
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Lemma 1. Let the K-algebra A be an integral domain and assume that TrDeg x (A) 
is finite. For any nonzero prime ideal p of A we have 


(24) TrDeg x (A/p) < TrDegx (A). 
Proof. For notational simplicity set A = A/p and denote by 
g: A> A, 
XX 


the quotient map. If X1,...,X 5 are algebraically independent over K, so are 
X1,...,X s. Thus we already know that TrDegx (A) < TrDegx (A). Now assume, 
in contradiction with (24), that 


TrDeg x (A) = TrDegx (A) =: r. 


Then there exist r algebraically independent elements X;,...,X, of A. The map y 
is injective on K[x1,...,x,], So g can be extended to a homomorphism 


@: K(x1,...,x,)[A] > K(X1,...,X,)[A], 


where the K-algebras in question are regarded as subalgebras of the fraction field 
of A or A as the case may be. Let A be the K-algebra on the left. Since r = 
TrDegx (A), every element of A is algebraic over K(x1,...,Xn). Thus A is a 
field. But then @ is injective, and hence so is gy. Because p = kerg we obtain a 
contradiction with p 4 0; this proves the lemma. 


With the help of this result we now show a result which is equivalent, thanks to 
(22), to Theorem 5: 


Theorem 5’. Let A be an affine K-algebra without zero-divisors. Then 
(25) dim A = TrDeg(A/K). 


Proof. Clearly TrDeg(A/K) is finite, since A is finitely generated as a K-algebra. 
We now use induction on r := TrDeg(A/K). 


(i) We first want to show that dim A <r. Let 
0=po Cpr Cp2C---C Pm 


be a strict chain of prime ideals of A; we mush show that m <r. If r = 0 then A 
is a field, som = 0. Now let r = 1 and (avoiding triviality) m = 1. Applying the 
natural map A > A = A/p, we get a strict chain of prime ideals 


O=p1 Ch2 C++ CPim 


of A. By the induction assumption and Lemma 1 it follows that m—1<dim A<r—1, 
and hence that m <r. 
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(ii) Now we wish to show that r < dim A. For r = 0 there is nothing to show, so let 
r = 1. By Noether’s Normalization Theorem (Chapter 18, Theorem 4), A contains 
a polynomial ring R= K[y1,...,,] in r variables as a subalgebra such that A/R 
is integral. For the prime ideal p = (y,) of R there is a prime ideal 8 of A such 
that BNR =p, by §16.12 in the Appendix. Setting 


A=A/f, R=R/p, 


so that R~ K[y1,..., Yr—i], We obtain in this way a ring extension 4/R. But 
since A/R is integral, so is A/R; it follows that TrDeg(A) = TrDeg(R) = r — 1. 
By the induction assumption this implies the existence of a strict chain of prime 
ideals 


0= $1 CP. c---cP, 
in A. Denoting by 1,..., 8, the inverse images of these ideals in A, we obtain 


a strict chain of prime ideals 0 = Bo C Pi C--- C P;. This shows that indeed 
r<dimA. 


Remark. Let the situation be as above. In part (ii) of the proof we showed that 
r <dim A. If A is a field, so that dim A = 0, it follows that r = 0, that is, A/K is 
algebraic. Thus the considerations in (ii) amount to a generalization of the argument 
used in Chapter 18 to prove Theorem 5 (page 215). 


We will now supplement Theorem 5 with something sharper. But we again need 
preparatory results: 


Lemma 2. /f R is a UFD, a prime ideal p has height \ if and only if it is a principal 
ideal. 


Proof. Suppose h(p) = 1. Then p ¥ 0, and we may take a nonzero f € p. Since p 
is a prime ideal, at least one prime factor /, of f lies in p. Now (/1) is likewise a 
prime ideal, and we have 0 4 (f1) Cp. Since A(p) = 1, we get p = (/1). 
Conversely, suppose p = (f), and assume for a contradiction that h(p) 4 1. 
Then there is a prime ideal q such that 0 4 q Cp =(/). If g is a nonzero element 
of q there must be a prime factor g; of g in q. There follows (g;) C (f), and 
because gy is irreducible we must have (f') = (gi) € q. Contradiction. 


Lemma 3. Let R = K[y1,...,,] be a polynomial ring in the r indeterminates 
Vis++++ Vr over the field K. For every prime ideal p of height h(p) = 1 in R, the 
quotient ring R = R/p has transcendence degree r — 1 over K. 


Proof. By Lemma 2, p has the form p = (p), where p € k[y1,..., y+] is a noncon- 
stant polynomial. If, say, the variable y:= y, really does appear in p, we also have 
deg, p = 1. It follows that 


pn K[y1,...,¥r—-1] = 0. 


Thus the quotient map R > R = K[V,..., ¥] gives rise to an isomorphism on the 
subring K[y1,...,¥r—1]; that is, ¥1,..., ¥-—-1 are algebraically independent over 
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K. At the same time, p(V1,..., ¥r—-1, Vr) = 0 is a (nontrivial) algebraic equation 
for p, over K(V1,..., Vr—1). Putting the two facts together we conclude that indeed 


TrDeg(R/K) = TrDeg(K(¥1,...,9,)/K) =r—1. 


For the announced sharpening of Theorem 5 we also need a basic fact from 
commutative algebra, whose justification we leave for the Appendix ($19.6): 


F6* (Krull’s Descent Lemma). Let A/R be an integral extension of integral do- 
mains, and assume that R is integrally closed (in its fraction field). Suppose given 
prime ideals p and q of R with q C p, and also a prime ideal 8 of A such that 
$30 R =p. Then there is a prime ideal Q of A such that DN R=qandQC. 


Theorem 6. Let A = K[x1,..., Xn] be an affine K-algebra, and assume that A is an 
integral domain. Let 
(26) 0= Po CPi C--- C Pm 


be a nonrefinable strict chain of prime ideals in A (such a chain must exist because 
dim A < oo; see Theorem 5’). Then 


(27) m = TrDeg(A/K); 


in particular all maximal ideals of A have the same height, namely the transcendence 
degree of A over K. (We thus recover the result that dim A = TrDeg(4/X).) 


Proof. We proceed inductively. Let m = 0. Then 58, = 0 is a maximal ideal of A, 
so A is a field. Using Chapter 18, Theorem 5 or equation (25), we conclude that 
TrDeg(A/K) = 0. 

Now suppose m => 1. By the Noether Normalization Theorem, A contains a 
polynomial algebra R = K[y1,..., y+] as a subalgebra such that A/R is integral. 
Since (26) cannot be refined, we have h(58,) = 1. We claim that the prime ideal 
p1 = 5B, R also has height 1. Otherwise there is a prime ideal q C p; such that 
0Aq# 1. Krull’s Descent Lemma then provides a prime ideal Q of A such that 
OCP, and QN R=q. Since q is not the zero ideal, neither is Q. Thus (B81) = 1 
implies 8, = Q, and we get q=QNR=YPB,N R= py, contradicting q ~ p,. Now 
setting 

A=A/P, and R=R/py, 


we obtain a ring extension A/R. Since A/R is integral, so is A/ Re There follows 
TrDeg(A/K) = TrDeg(R/K). But by Lemma 3 we have TrDeg(R/K) = r —1, so 
TrDeg(A/K) =r—1. 


Passing from A to A, we get from (26) a strict and nonrefinable chain of prime 
ideals eA _ 

0= 21 CP2C---C Pm 
of length m—1 in A . By induction (on m or the transcendence degree of A) we then 
get m—1 = TrDeg(A/K) =r-—1. Therefore m =r = TrDeg(A/K), as claimed. 
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We prove an application of Theorem 6: 


F6. A K-variety V of C” has dimension n—1 if and only if it is of the form V=N(f), 
where f is an irreducible polynomial in K[X1,..., Xn]. 


Proof. (i) Since V is irreducible, its ideal p; is a prime ideal of K[X1,..., Xn] =: A. 
Now suppose dimV = n—1. Because A = A/p; we have n—1 = dimV = 
dim K[V] = dim A (see F5). Thus there exists a strict chain of prime ideals 0 = 
p; C-+-C py in A. Taking inverse images we then get a strict chain of prime ideals 
0= po Cpi C--- Ch» in A. Since dim A =n, we must have h(p;) = 1. Therefore, 
by Lemma 2, p; = (/) is a principal ideal. Thus V = N(f), with fe K[X1,..., Xn] 
irreducible. 


(ii) Suppose V = N(f), where f € K[M1,..., X,] =: A is irreducible. Set p; = (/). 
By Lemma 2, /(p;) = 1. Since dim A < oo there is then a nonrefinable strict chain 
of prime ideals 0 = po C py C--- C pm (containing the given prime ideal!). Now 
we resort to Theorem 6, according to which m =n. Passing to A = A/p; we get a 
strict and nonrefinable chain of prime ideals of length n—1 in A. Using Theorem 6 
again we conclude that n — 1 = dim A = dim K[V] = dim V, as desired. 


From Theorem 6 some consequences of a general sort can be drawn. We leave 
the simple demonstrations to the reader: 


F7. Let A be an affine K-algebra and an integral domain. For every prime ideal p 
of A, 


(28) h(p) + dim A/p = dim A. 


Moreover, if 0 = po C pi C++: Cp, = p is a strict chain of prime ideals in A, with 
last element p, and if there is no finer chain of the same description, then r = h(p). 

In the context of K-varieties, equality (28) becomes: For every prime ideal p 
of K[X,,...,Xn], the algebraic set V = Nc(p)—where, as we recall, C is an 
algebraically closed extension of K — has dimension 


(29) dim V =n—h(p). 


One immediate consequence of the first statement in F8 is that, for 1 <r <n, the 
prime ideal (X1,..., X;) of K[X1,..., Xn] has height r; the strict chain of prime 
ideals 0 C (X1) C (X1, X2) C--- C (M%,..., X,), therefore, admits no refinement. 

Here is an addendum to F7: 


F7’. For every nonzero f in K[X1,..., Xn], all the irreducible K-components of 
N(f) = Nc(/) have dimension n—1. Conversely, if V is an algebraic K-set of C” 
whose irreducible K-components all have dimension n—1, then V is a hypersurface, 
that is, it can be expressed as V = N(f) for some nonzero f in K[X1,..., Xn]. 


Proof. If V=N(f) and f = et ... fy" is the prime factorization of f, we have 
V =V,U---UV,, where the V; = N(f;) are K-varieties. By F7, each V; has 
dimension n — 1. 
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Conversely, if the irreducible K-components V;,...,V, of an algebraic K-set 
V of C” all have dimension n — 1, then by F7 each V; is of the form V; = N( fj). 
Setting V=V, U---UV, =N(fi) U---UN( fo) = N(f.... f-) we get the desired 
result. 


Remark. Let V be any algebraic K-set of C”, and suppose that fe K[X1,..., Xn] 
does not vanish on any of the irreducible K-components of V. Then 


(30) dim(V NN(f)) < dimV — 1. 


To see this, we start by assuming without loss of generality (see F4) that V is a 
K-variety. Every K-component W of VN N(/) is a proper subset of V. Thus 
the assertion follows directly from Definition 5. In Section 27.5 in volume II we 
will show, for the case K = C, that in fact dim W = dim V — 1; this represents an 
important generalization of F7’ in the case K = C. 


Finally, using (30) one can prove the following fact (see § 19.10 in the Appendix): 


F9 (Kronecker). An algebraic K-set V of C” can always be represented as the zero 
set of at mostn + | polynomials. 


As Kronecker presumably already knew, and as proved for example by U. Storch 
(“Bemerkung zu einem Satz von M. Kneser”, Archiv der Math. 23, 1972), one can 
actually replace n + 1 by n in this statement. It is an interesting — and difficult — 
question to determine under what conditions V has a representation involving ex- 
actly n—dim V polynomials. 


Appendix: 


Problems and Remarks 


References preceded by § are to this appendix. 


Chapter 1: Constructibility with Ruler and Compass 


1.1 Let K be a subfield of C and a, b elements in K*. Show equivalence between: 
(i) K(./a) = K(Vb). 
(ii) There exists c € K* with a = bc?. 


(The assumption K C C is not essential; the statement holds for any field K where 
1+140.) 


1.2 Let K be a subfield of C with K = K and let w be a complex number such that 
w? € K. It is then always that case that K(w) = K(w)? The answer is no (that is 
why in the proof of Theorem | we had to take a certain precaution). Hint: Consider 
the example K = Q(i) with w? = 1+ (and use §1.1 witha =1+i, b~-! =1—i). 


1.3 In F9, the assumption K = K cannot be dispensed with—nor can the field 
Q(M U M) in Theorem 1 by replaced by, say, @(M). One can see this most 
spectacularly as follows (while peeking at some concepts that will only be treated 
later, in Chapters 2 and 18): choose algebraically independent real numbers x, y 
and set K = Q(x +iy), E = Q(i, x, y). Then E C AK, but E/K is not algebraic, 
since otherwise by looking at transcendence degrees we would get the contradiction 
2 = TrDeg(E/Q) < TrDeg(K/Q) < 1. 


1.4 Let E/K be a finite field extension. Prove that if E: K is a prime number, 
E/K has no proper intermediate fields, and for each a € E such that a € K we 
therefore have E = K(q). 


1.5 Prove that Q(./2, /3):Q =4 and Q(/2+¥V3) = Q(V2, V3). 
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1.6 Let K be an infinite field and E an extension of degree n > 1 over K. Show 
that the quotient group E*/K™ of the multiplicative groups of E and K is infi- 
nite. (Hint: Otherwise the K-vector space E ~ K” would be a union of finitely 
many one-dimensional subspaces; in other words, the projective space P,—;(K) 
would be finite.) Remark: E*/K™ is not even finitely generated; but this is a 
much deeper result, for which see A. Brandis, “Uber die multiplikative Struktur von 
KG6rpererweiterungen”, Math. Zeitschrift 87 (1985). 


1.7 A Danish schoolbook of 1854, published in Flensburg, contains the following 
recipe for constructing the heptagon inscribed in a circle S of radius | around the 
origin: Let the circle of radius 1 centered at z; = 1 intersect S at z2 and z3. Let 
Z4 be the intersection of the line through zz and z3 with the line through 0 and z;. 
Beginning at z;, mark off the distance |z4 —z2| against the circle S, seven times in 
succession. 

Does this mean that Gauss’s statement (see F12 in chapter 5) that ert ¢ 
A{0, 1} is in error? Show that the points obtained according to the procedure above 
are the powers Z, z*,...,2! of the complex number z = s+ ; 397. It follows that 


6553627 = 65530 — 142/391. 


Chapter 2: Algebraic Extensions 


2.1 Let E/K be a field extension and let L;, L2 be intermediate fields of E/K 
with L;: K < oo. Prove that L;L2:K =[L,: K]-[L2: K] implies L; 0 L2 = K. 
(The converse does not hold; see the Example in Section 3.1.) 


2.2 Show that Q(./2, /T+i):Q = 8. Hint: For w = /1 +i we have wo = V2, 
so Q(/2, w) = Q(i, w, w). Now see §1.2. 


2.3 Let E/K be an extension. Prove that £/K is algebraic if and only if every 
subring R of E containing K is a field. 


2.4 Let R be a commutative ring with unity and K a subring of R. Prove: If K is 
a field and R has no zero-divisors, then 1 rp = 1x (in particular 1r #0, so R is an 
integral domain and a K-vector space). Show by example that 1r and 1x can be 
distinct if R has zero-divisors. 


2.5 Define a sequence (a), of real numbers a, > 0 through the recursion a; = 2, 
An+1 = fy. Then a, € AQ for every n, and 


(1) Q(on+1):Q = 2", 


so the algebraic extension AQ/Q cannot be finite. For the proof, we will show by 
induction over n that 


(2) Q(An+1):Q(An) = 2 


for every n. The initial case n = 1 follows from /2 ¢ Q. Assume that for some 
n > | equation (2) is false; then a@,4+; € Q(an) = {a+ bay | a,b € Q(ay_1)}. But 
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On41 =a+ bay implies of, = a7, , =a? + b?a7 + 2abdy = a? +b? a1 + aban. 
From the induction hypothesis we get a> + b?an—1 = 0, and since d,_; > 0 we 
obtain a = b=0. 

It’s worth mentioning that the equality in (1) can be read off in a trice from 
certain later results from Chapters 3 and 5: @,4+1 is a root of the polynomial X aD 


which is irreducible over Q. 


2.6 Let (pn)n be the sequence of prime natural numbers. Working just as in §2.5, 
prove by induction that 


Q(/Piv-.-s J Pn):@ = 2". 


In Chapter 14 we will see that this equality, and also that of §1.1, follow directly 
from Kummer theory. 


2.7 Prove that the numbers a = sin 45° and b = cos 72° are algebraic and irrational. 
(Hint: For b see Chapter 1, Example 3 after Definition 2.) 


2.8 What is the minimal polynomial of /2 + V3 over @? (See §1.5.) Consider 


the real number 
a= V¥54+2V6 


and show that 1,a,a?,a@3 form a Q-basis of Q(a), as do 1, J2, 73, 76. Is the 
polynomial X4*— 10X? + 1 irreducible in QL]? 
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3.1 Let E/K be an extension and L,, L2 intermediate fields of E/K with L;: K 
finite. Then necessarily L;L.:L2 < L,:K (Chapter 2, F11), but L; 2: Lz is not 
necessarily a factor of L,:K. Hint: Consider Ly = Q(+/2) and Lz = Q(é3 ¥/2) 
with C3 = e27#/3, 

3.2 Let E/K be a field extension and suppose a, 6 € E are algebraic over K. Set 
Jf = MiPox(q) and g = MiPox(). Prove that f is irreducible over K(f) if and 
only if g is irreducible over K(a). 

3.3 Let K be a field and f € K[X] a polynomial of degree n > 0. Using induction 


on n and Theorem 4 (Kronecker), show that there exists an extension E of K such 
that f can be expressed as a product of linear factors over E: 


T(X) = y(X — ay) (X — a2)... (X — ay). 
Therefore the subfield K(a1,...,@,) of E satisfies K(a1,...,Q@n):K <n. 


3.4 Keeping the assumptions and notations of §3.3, show that K(a1,...,@,):K is 
in fact a divisor of n!. (Hint: Use induction on n = deg f. If f = gh in K[X] with 
deg g =r and degh = n-—r, then r!(n—r)! is a divisor of n!). Show furthermore 
that if K(a1,...,@n):K = (n—1)!, then f is irreducible over K or f already has 
a root in K. 
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3.5 How does it follow from Kronecker’s Theorem that there must exist a field with 
say exactly 9 elements? 


3.6 Let K(X) be the field of rational functions in one variable over the field K. 
Prove: 
(a) If T = T(X) is an element of K(X) not contained in K, the extension 
K(X)/K(T) is finite. Hint: Let Y be an indeterminate over K(X) and let 
T be of the form T = f/g, with f,g € K[X]. Then X is a root of the 
polynomial f(Y)—7Tg(Y) € K(T)[Y], and since T ¢ K, this polynomial is 
nonzero. 
(b) K is algebraically closed in K(X), that is, the algebraic closure of K in K(X) 
is K. (Hint: Using (a), this follows from the transitivity of algebraicness.) 
(c) K(X)/K is a simple field extension possessing infinitely many intermediate 
fields. (Hint: If K(a)/K is not algebraic, w? is transcendental over K and 


K(a?) # K(a).) 
(d) If K/k is a finite field extension of degree n, so is K(X)/k(X). 
3.7 Let E/K be a field extension and take a, 6 € E*. Suppose that a” € K 
and 6” € K for certain relatively prime natural numbers m,n. Prove that aB is a 
primitive element of the extension K(a, B)/K. (Hint: there exist x, y € Z such that 
1=xm-+yn.) 
3.8 Let E be a subfield of C and Ey = ENR. Prove: 
(a) It is not always the case that F': Eo < 2. 
(b) If E = Q(é,) with &, = e?/", then Ey = Q(np) with m =&+6,! = 
fn + fn = 2c0s(2n/n), and E: Eo = 2 forn > 2. 
(c) Forn 4 1,2,3,4,6, the number ¢, + cc is irrational. (Hint: If a € Q isa 
zero of a normalized polynomial f € Z[X], then a € Z.) 
(d) @(¢7):Q = 6, so the regular heptagon cannot be constructed with ruler and 
compass. (Show that 77 is a zero of a cubic polynomial over Q.) 


3.9 Let G be an abelian group of order n (written multiplicatively), and denote by 
j(G) the product of all elements of G. Consider the subgroups G2 = {x €G| x? = 
1} and G* = {x? | x € G} of G. Prove: 

(a) “(G) = u(G2). 

(b) If G is a subgroup of the multiplicative group of a field K and —1 € G, then 

w(G) = -1 = (-1I)"?u(G?). 

Deduce that: 

(a) In the field F, we have (p—1)!=-—1. 

(8) For p A 2 the element —1 is a square in F, if and only if p = 1 mod 4. 
3.10 Let the extension E = K(a, B) of K satisfy K(a): K = p and K(B):K =q, 
with p >q both prime. Assume moreover that char K # p. Prove that E = K(a+ 8). 

Hint: If this were not so and we set h(X) = MiPox(@ + f), we would have 

h(X + B) = MiPox (a). 
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3.11 Let K be a field and, for elements a,b in K*, write a ~ b if ab is a sum of 
two squares in K. 


(a) Why is ~ an equivalence relation? 


(b) Does the analogous statement hold if one replaces 2 by some arbitrary power 
2”? (Hint: See LA II, p. 187, Problem 87.) 


(c) In the case K = R(X), prove: f ~ 1 if and only if f(x) > 0 for allx ER 
where f(x) is defined. (Hint: Look first into the case f € R[X], and take into 
account Section 4.3, Remark 2 after Definition 5.) 


3.12 The construction of the fraction field of an integral domain given in the text can 
be generalized. Let R be a commutative ring with unity, and let S be a multiplicative 
subset of R, that is, a set S containing | and for which s,t € S implies st €¢ S. On 
the set M = Rx S, form the relation ~ given by 


(x,s)~(y,t) means du €S such that (xt—ys)u =0. 


It is easy to prove that ~ is an equivalence relation.! Denote by x/s the equivalence 
class of (x, s), and let S~! R be the set of all such equivalence classes. If the classes 
x/s are added and multiplied in the usual way (prove that this is well-defined), the 
set S~!R becomes a commutative ring with unity; it is called the ring of formal 
fractions of R with denominators in S, or the localization of R relative to S. For an 
integral domain R and S = R \ {0}, we have S~! R = Frac R. Now, if we denote 
by 1: R-> S~!R the homomorphism defined by 1(x) = x/1, we have a universal 
property analogous to F7: Let« : R — B be a ring homomorphism (of commutative 
rings with unity) such that k(s) is invertible for every s € S. There exists exactly one 
ring homomorphism d.: S~! R > B such that hot=k. 

The ring A := S~!R and the homomorphism i: R > A have the following 
properties: 
(i) ¢(s) is a unit in A for every se S. 
(ii) Every element of A is of the form c(x)i(s)~!, with x € Rands¢S. 
(ili) ¢(x) = 0 if and only if there exists s € S with xs = 0. 


Because of this last property, : is not always injective, so R in general cannot be 
regarded as a subring of S~! R. But, analogously with F8, we have: If 1: R—> A isa 
ring homomorphism with properties (i)—(i1i), then A is isomorphic to the localization 
of R relative to S. Incidentally, S~! R is the zero ring if and only if 0¢ S. 


3.13. Let R be an integral domain with fraction field K. Assume R is a Bézout 
ring, that is, given a,b € R there is always some d € R with thataR+bR=adR. 
Prove that every subring A of K containing R is of the form A = S~! R, for some 
multiplicative subset S of R. Hint: Consider S = A* 1 R. 


3.14 (a) Find a subfield K of C and z € AK such that K(z):K is finite but not a 
power of 2. How is this to agree with Chapter 1, F9? (Hint: Choose K = Q(x) 
with x = /2+it, where ¢ is a transcendental real number.) 


! The proof will make it clear why we can’t just demand xt = ys in the defining condition. 
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(b) Find a subfield K of C and z € AK such that K(z)/K is not algebraic. How 
is this to agree with Chapter 2, Theorem 1? (Hint: Choose K = Q(s + it) 
with s,¢ € R, where ¢f is transcendental and s is not in the algebraic closure 
of Q(t) in C. Then x = 5 +t is not algebraic over Q(t). Prove that if ¢ is 
algebraic over Q(x), then x is algebraic over Q(t).) 


Chapter 4: Fundamentals of Divisibility 


4.1 Find the gcd of a = 17017 and b = 1114129 and write it as an integer linear 
combination of a and b. Note how much faster it is, already in this simple example, 
to use the Euclidean algorithm than to find prime factorizations. (The factorizations, 
by the way, are a= 17-13-11-7 and b = 17- 65537.) 
4.2 Let R be a UFD. Using F12, prove: 

(a) If y € R and 6 is a gcd of a1,...,@, in R, then yé is a ged of yay,..., yan. 


(b) If a1,...,@, are pairwise relatively prime elements of R and their product 
102...Q@, is an m-th power in R, each a; is associated to an m-th power 
in R. 


(c) If P is a directory of primes of R and K = Frac R, the multiplicative group 
of K satisfies KX ~ R* x Z®), 


4.3 Find the prime factorization of ¥*4 + 1 in CX], in R[X] and in QLX]. (Aint: 
In C[X] the answer is clear; the rest follows.) How about the prime factorization of 
X°+4+X +1 in Q[X], in F2[X] and in F;9[X]? Show that Y* + 4 is not irreducible 
in Q(X]. 


4.4 Show that: 


(a) The number 2 is irreducible in R = Z[—5], but not prime. (Hint: Use the 
fact that the function Na = a@ is multiplicative; why is aw € R* equivalent to 
Na = 1?) 

(b) Z[V—1], Z[V—2], Z[4(—1+ V—3)] and Z[V2] are all Euclidean domains. 
(Hint: For Z[./2] consider Na := |aa’|, where a’ = a—b V2 for a =a+bvV2.) 


4.5 The statement that Y* = X?—2 has exactly one solution (x, y) in natural 
numbers goes back as far as Fermat. Prove its truthfulness, by working in the ring 
Z[/V—2] and making use of §4.4b and §4.2b. Remark: However, the same equa- 
tion has a whole series of rational solutions (x,y), such as (129/100, 383/1000), 
(164323/1717, 66234835/1713), and so on; this is connected with the operation of 
addition on elliptic curves. 


4.6 Let R be a principal ideal domain and consider an infinite strictly decreasing 
chain of ideals in R, say J; D [2 D [3 D---. Show that ‘aes I; = (0). 


4.7 (a) Let R be a subring of an integral domain R’. Assume that R is a principal 
ideal domain and that a,b € R. Show that a gcd of a,b in R is also a gcd of 
a,b in R’. 
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(b) Let E/K be a field extension and let f, g € K[X] be polynomials over K. 
Show that if a normalized polynomial h € E[X] is the gcd of f, g in E[X], 
all the coefficients of already lie in K. 


4.8 Let R be an integral domain in which any two elements x, y € R have an Icm. 
Show that every irreducible element of R is prime. (Hint: If a{a, then az is an 
Icm of a, 7.) 


4.9 Let R be an integral domain in which any two elements have a gcd. Show that 
any two elements a, b also have an lcm (namely ab/gcd(a, b) if a,b # 0). 


4.10 Here is an elementary proof that Z is a UFD: by induction, it is clear that any 
n > | is a product of prime numbers. Now use §4.8. Prove that if 7 is the smallest 
of all common natural multiples of two given integers x, y # 0, then m is an lcm 
of x, y. (Hint: A one-time application of division with remainder.) 


4.11 Set R = Z[/10]. Show that in R every element a ¥ 0 is the product of 
irreducible elements, but R is not a unique factorization domain. (Hint: Consider 
the multiplicative function N defined as in §4.4b, and note that w € R™ if and only 
if Na=1.) 


4.12 Let R be a commutative ring with unity and S a multiplicative subset of R 
(see §3.12). Form the localization S~!R of R relative to S, with canonical map 
t: R— S~!R. If ais an ideal of R, denote by S~'a the ideal of S~! R generated 
by (a). It is easy to check that S~!'a consists of all elements of the form a/s with 
a€aands €S; moreover S~'a = (1) if and only if an S 4 @. Conversely, if 2 
is an ideal of S~! R, denote the ideal :~!(2) of R by AN R. Then a is of the form 
a=. !(Q) if and only if no element of S gives rise to a zero-divisor of R/a. Prove 
that the maps Bt YN R and p++ S~'p establish a one-to-one correspondence 
between prime ideals of S~' R and prime ideals of R that are disjoint from S. 


4.13 Let R be a commutative ring with unity and p a prime ideal of R. Then 
S := Rw p is multiplicative, in the sense of §3.12. In this case we denote the ring 
S~!R by Ry, and call it the localization of R at p. Set M = pRy = S~'p; then 
1 € IN. Every element of Ry not belonging to M is a unit of Rp, and conversely. In 
other words: each ideal 21 4 (1) of Ry is contained in 9M. In yet different words: 
IM is the unique maximal ideal of Ry. (A commutative ring with unity that has 
a unique maximal ideal is called a local ring; in this connection see Chapter 6, 
F12.) From §4.12 it follows that prime ideals in the local ring Ry are in one-to-one 
correspondence with prime ideals of R contained in p. 


4.14 Let R be a commutative ring with unity. An element f of R is called nilpotent 
if there is a natural number 7 such that f” = 0. Denote by 9 the set of nilpotent 
elements of R, called the nilradical of R. Prove that the nilradical of R is the 
intersection of all prime ideals of R. (Hint: Given f € R, consider S = { f"|nENo}. 
If f is not nilpotent, S~!R is not the zero ring and thus has a maximal ideal ; 
see Chapter 6, F12. Now apply §4.12.) 
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4.15 Let R be a commutative ring with unity 1 4 0. Show that if every principal 
ideal of R distinct from R is prime, R is a field. (Hint: Consider the principal ideals 
(0) and (a?) for all a £ 0.) 


4.16 Find an example of a noncommutative simple ring. (Hint: Consider the matrix 
group M,,(K); see LA IL, p. 179, Problem 46.) 


4.17 Let m,n be natural numbers with m|n. Show that the canonical map 
(Z/nZ)* —> (Z/mZ)* 


is surjective. (Hint for one possible solution: Look first at the case of a prime power 
n= p’ and then bring the Chinese Remainder Theorem to bear.) 


4.18 The ring Z[i], where i = /—1, is called the ring of Gaussian integers. By 
§4.4b above it is a Euclidean domain, and so has the same nice divisibility properties 
as Z. Thus it is pertinent to ask: (a) What primes p of Z remain prime in Z[i]? (b) 
How does one locate all primes in Z[i]? 

Answer: For each Gaussian prime z there exists a Z-prime p with | p. Now, 
either (p) = (zr), or there exists (p) = (71) such that (77) ¥ (77), except in the case 
p =2. The first case happens when p = 3 mod 4, and the second when p = | mod 4 
or p=2. 

Therefrom deduce Fermat’s Theorem: A prime number p # 2 can be written in 
the form p = x?+ y* with x, y integers if and only if p = 1 mod 4. (Hint: For p = 
1 mod 4, by §3.9, there is some x such that p is a divisor of x* +1 = (x +7)(x—i). 
Now, if p is prime in Z[i], it follows that p|x-+7.) 

What would one have to know in order to perform an exactly similar analysis 
of, say, the Euclidean domain Z[/2]? 


4.19 Let E be a field, and assume that f € E[X] has a decomposition f = 
fifo... f, into pairwise relatively prime factors. Show that 


E[X\/f ~ ELX]/fi x-+-* E[XV/fr- 


(This is an application of the Chinese Remainder Theorem, but it also can be proved 
easily by a direct dimension argument.) 


4.20 Let A be a ring with unity. Assume that A = A; x--- x A; is a direct product 
of (sub)rings A;. 
(a) Prove that the ideals of A are precisely those subsets of the form J; x--+ x J;, 
where each J; is an ideal of A;. This statement also holds for left ideals 
instead of two-sided ideals. 


(b) Assume each ring A; is simple and prove that the A; coincide with the minimal 


ideals of A, and therefore are uniquely determined. (An ideal J of a ring A is 
called minimal if it is minimal in the set of all nonzero ideals of A.) 


4.21 Let R be a UFD where the ideal generated by any two elements is a principal 
ideal. Prove that R is a principal ideal domain. 
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4.22 Let R= €(X) be the ring of all continuous functions on X =[0, 1] with values 
in R. Fora eX let I, be the set of f € R such that f(a) = 0. Show that the J, 
are the only maximal ideals of R. (Hint: X is compact.) Is the same true if X is 
the open interval Y = (0,1)? (Hint: This question cannot be answered if F12 in 
Chapter 6 is not accepted.) 


4.23 Let R be a UFD, and suppose there is a prime element g of R such that no 
unit e # 1 of R satisfies the congruence e = 1 mod q. Show that R has infinitely 
many prime principal ideals (p). In particular this is true if R= Z or R= K[X] 
for K a field. 

Hint (Euclid): Let pj,..., Pn be primes in R with py = q. Then 1+ pi... Pn 
has at least one prime divisor p. 


4.24 For m,n €N relatively prime, show that: 
(a) Q('V2, 73) = Q('V2 ¥3). 
(b) A nonzero element a of a field K is an mn-th power in K if and only if a is 
both an m-th power and an n-th power in K. 


4.25 Determine all maximal ideals of the following rings: 

(a) @xQ; (b) Q[X]/X74+X¥ +1; (© QLX]/X?; Wd QLX]/ xX? -3X +2. 
4.26 Let K be a field of characteristic 0, and denote by E = K(X) the field of 
rational functions in the variable Y over K. For givena,b € K consider the subfield 
F = K(X*+aX +b). Set Fo = K(X7) and show that the extensions E/F and 
E/Fo are finite (of degree 2), but if a ~ 0 the extension E/ FN Fo is not algebraic. 
(Hint: For each f € F we have f(—X —a) = f(X). Show that any f € K(X) 
satisfying f(X +a) = f(X) must be constant.) 
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5.1 Show that the following polynomials are irreducible in Q[X]: 
(a) 3X4+4+6X?—-12X +10; (b) 2X44 5X34 5X74 6X +12. 


5.2 Why is 2 not an element of Q(/5)? Why is there no extension E of R 
such that E:R = 3? 
5.3. Prove: 
(a) XY? + X +1 is the only prime polynomial of degree 2 in F 2[X]. 
(b) The polynomial f(X) = ¥*+3X3 + X?—2X +1 is irreducible in Z[X], 
therefore also in Q[X]. (Hint: First, f has no zeroes in @ —see F8. Now 
work mod 2; see the Remark after F9.) 


(c) ¥" +1 € Q[X] is irreducible for every m = 2” (see F13). 


5.4 Show that no element of the field R(X, Y) of rational functions in two variables 
over R is a square root of ¥4 + X¥7Y¥7+4+XY+4+X. 
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5.5 Show that if a1,...,@, are distinct integers, the polynomial 
f(X) = (X -a1)?(X - a2)... (X-an)? + 1 


is irreducible in Q[X]. (Hint: f is a normalized polynomial of degree 2n in Z[X] 
taking positive values at all wa € R and taking the value | at the n distinct points 
a1,...,@n. A nontrivial factorization f = gh with g,h € Q[X] normalized would 
therefore imply g =h = (X¥ —a1)...(X —an) +1.) 

5.6 Let f(X) = X"—up™, with p a prime and wu € Z relatively prime to p. Show 
that if m,n are relatively prime, f is irreducible. (Hint: At least the case u = 1 is 


clear, because then Q((%/p)”) = Q(2/P).) 
5.7 Consider f(X) = X¥*+ X¥ +1 € Q[X]. Show that: 

(i) /f is irreducible. 

(ii) In C[LX] we have f(X) = (X — z)(X — Z)(X — w)(X — Ww); the following 
relations are satisfied: z+ Z7+w+wW=0,27+ww+(z+Z)(w+w)=0, 
z7(w+W)+ww(z2+7)=—1, z7ww = 1. 

(iii) @:= 27 + ww = (2 +2)? is a zero of the polynomial Y? —4X¥ — 1. 

(iv) Although Q(z):Q = 4, the number z does not belong to AQ. 


5.8 In the situation of §3.6, choose for T € K(X) a representation T = f/g, with 
Sg relatively prime. Show that 


K(X): K(T) = max(deg /, deg g). 


(Hint: The polynomial f(Y)—Tg(Y) in §3.6 is irreducible in K[T][Y]= K[Y][T], 
therefore also in ....) 
5.9 Let E be a subfield of the field of rational functions K(X) in one variable over 
K, with E 4 K. 
(i) Why is K(X): E < oo? Consider the minimal polynomial of X over E, 
say Y" + ty-1(X)¥""! +--+ to(X). Show that, up to multiplication by an 
element of K[X], this polynomial coincides with a polynomial 


F(X, Y) = cn(X)Y" + Cn (X)Y"! +++ + c9(X) € K[XI[Y] 


that is primitive over K[X’]. 

(ii) Let ¢ be an element of E such that t € K andt = f/g with f, g € K[X] rela- 
tively prime. Show that F(X, Y) divides g(X) f(Y)— f(X)g(Y) in K[X][¥]. 
Then, using §5.8, deduce that degy F < K(X): K(t). 

(iii) In the same situation as (ii), assume that deg f, deg g < degy F and deduce that 
g(X)fW)-f(XY2(V) =aF(X, Y) with a € K*; and hence that K(t) = E. 

(iv) Using (iti), show that E'/K is purely transcendental, that is, E is itself a field 
of rational functions in one variable over K (Liiroth’s Theorem). Hint: At least 
one ¢;(X) does not lie in K, and therefore satisfies E = K(t;). 
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5.10 Let R be an integral domain and f = aX” +---+a,X +o a primitive 
polynomial in R[X]. Suppose there is a prime element z of R and polynomials 
y, Ww € R[X] such that 


S(X) =ag(X)" +2v(X), with meN. 


Let x  X be the quotient map from R onto R/z. Prove that, if @ is a prime 
polynomial in R[X] and y 4 0 mod @G, then f irreducible in R[X]. 

This generalization of Eisenstein’s irreducibility criterion goes back to T. Schéne- 
mann, who taught at the Havel Gymnasium in Berlin. He published his result (in 
Crelle’s Journal) in 1846, a couple of years before Eisenstein. 


5.11 Let f =X" 4+ an; X" !4+-+++ao be a polynomial in Z[X] with ao 4 0, and 
suppose f(X) = (X¥ —a1)...(X —an) over C. Show that / is irreducible over Z 
if @1,...,@,—1 have absolute value less than 1. 

5.12 Prove: 


(a) For every prime number p and every normalized f € Z[X] whose image if is 
irreducible in F ,[X’], the ideal (p, f) is a maximal ideal of Z|X]. (Hint: Why 
is there a homomorphism from F LXV Sf onto Z[X]/(p, f)?) 

(b) If P is a prime ideal of Z[X] that is not of the form given in (a), P is a 
principal ideal of Z[X]. If P is a principal ideal of Z[X], then P is not a 
maximal ideal. (Hint: If a nonzero g is in P, so is some prime factor of g.) 
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6.1 Show that Q( 7/2, /—3) is a splitting field of ¥3 — 2 over Q. 
6.2 Let E/K be an algebraic field extension and F an intermediate field of E/K. 
Prove: 
(i) If £/K is normal, so is E/F. 
(ii) If F: K = 2, then F/K is normal. 
Give an example where £/F and F/K are normal, but £/K is not. (Hint: See §1.2 


and keep Theorem 4 in mind; see also $6.3 below.) 


6.3 Prove that E’ = Q(/2, /I+i) is a normal closure of the degree-4 extension 
Q(/1+i)/Q. Thanks to §2.2, E’:Q = 8. 


6.4 Show that if E/K is an algebraic field extension with the property that every 
irreducible f € K[X] over £ splits into linear factors, then F is an algebraic closure 
of K. 


6.5 (a) Let L = K(qj,...,@,) be a finite-degree extension over K. Show that 
L/K is normal if and only if L is a splitting field of fi fo... fn over K, 
where fj = MiPox (qj). 
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(b) Let E/K and L/K be finite, normal field extensions. Show that there exists a 
K-homomorphism o : E — L if and only if there exist f, g € KX] satisfying 
the following conditions: g divides f, L is a splitting field of f over K and 
E is splitting field of g over K. (Consider F4.) 


6.6 (a) Let E/K be an extension and take f € K[X]. Prove the existence of the 
following isomorphism of F-algebras: 


(1) E@K[X]/f ~ E[X]/f. 


(b) Set F = Q(i). Show that the tensor product F © F of the Q-algebra F with 
itself is not a field. (Consider that F ~ Q[X]/(X? + 1).) 


6.7 Let E,;/K, E2/K be finite extensions and E an extension of both £,; and E>. 
Let EF, E2 := E,(E2) = E2(£}1) be the composite of FE; and E2 in EF. Show that 
EF, E, ~ E; ® Ez (as K-algebras) if and only if £, Fz: £; = £2: K. 


6.8 Let f, g be irreducible polynomials over the field K, both without multiple 
zeros (in an algebraic closure of K). Let L = K(a) and E = K(f) be extensions of 
K with f(a) = 0 and g(f) = 0. Show that if f = fi... fr and g = g1... gs are 
the prime factorizations of f over K(6) and of g over K(a), we have r = s, and 
after reordering we also have E[X]/f; ~ L[X]/g; for all 1 <i <r; in particular, 


(2) [K(B): K] deg f; =[K(a): K] deg gj. 


(Hint: Use $6.6 together with §4.19 and §4.20.) This result is a significant strength- 
ening of §3.2; it is due to Dedekind. Incidentally, it was also Dedekind who baptized 
what we know as fields (with the German word K@érper, literally “body’’). 


6.9 Let R be a commutative ring with unity, S a subset of R and a an ideal such 
that aN S = ©. Show that the set of ideals b of R such thataC b andbnS=@ 
has maximal elements. These elements are prime ideals when S is a multiplicative 
set (see §3.12). As an application, show that if R = €(X) is the ring from §4.22, 
every maximal ideal m = J, of R contains a prime ideal p distinct from m. 


6.10 Let A = R[X,..., Xn] be the polynomial ring in 7 variables over an integral 
domain R. Denote by Aq the set of all homogeneous polynomials of degree d (that 
is, polynomials in A where only monomials aS ... Xn" of degree d = v1 +++++q 
appear). Prove that, as an R-module, A is the direct sum of the submodules Ag, 
d ENo. We have AgAe C Age. A polynomial f € A is homogeneous of degree 
d if and only if the equation f(tX,,...,tX%n) = t4 f(X1,..., Xn) holds in the 
polynomial ring A[r]. 


6.11 In order to prove Theorem 1, which is fundamental in our context, we had to 
introduce in Section 6.3 the notion of the tensor product of K-algebras. The notion, 
familiar from linear algebra, of the tensor product of (finitely many) K-vector spaces 
is not subsumed under the definition given in Section 6.3, because for K-algebras 
we demand the existence of a unity element (and therefore a K-vector space cannot 
simply be regarded as a ring with trivial multiplication). In general, suppose K is 
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a commutative ring with unity, and let V, W be K-modules. A tensor product of 
V,W isa K-module V @ W together with a bilinear map 


am:VxW >Ve@W, 
(x,y) > x@y, 


satisfying the following condition: Given any bilinear map 6 : V x W — Z, there 
exists a unique linear map f : V ® W — Z such that f(x ® y) = B(x, y) for all 
xEeV,yeW. Prove: 
(i) Let (V ®, W, 21) and (V ®2 W, x2) be tensor products of V, W. There exists 
a unique isomorphism 4: V @1 W > V @2 W such that A(x @1 vy) =x @2y. 
(ii) Every element of V @ W is a finite sum of elements of the form x @ y. The 
following relations hold: 


(xt+x)@y=x@ytr @y, 
XO(V+yY)=x@ytx@y’, 
ax@y=x@ay=a(x®y) for~weK. 


(iii) Given K-homomorphisms f: V > V’ and g: W > W’, there exists a unique 
K-homomorphism h : V @ W > V’ @ W’ such that h(x ® y) = fx @ gy. 
This homomorphism is written h = f @ g. 

(iv) There is a canonical isomorphism f : K ® V > V such that f(a @ x) =ax. 

(v) For any family (V;);er of K-modules, there exists a canonical isomorphism 
(Dier Vi) @ W > Dies (Vi @ W) such that (7; x1) @ vy Yi @ y). 

(vi) There is a canonical isomorphism V @ W > W @V such thatx@ yh y@x 
and a canonical isomorphism (V ® W) ® Z — V ® (W ® Z) such that 
(x@y)@zZx@(y @zZ). 


6.12 Prove the existence of a tensor product (V ® W, ) for arbitrary K-modules. 
Hint: As in Section 6.3, start with the free K-module F = KM generated by the 
set M = V x W;; then form the K-submodule U generated by all elements of one 
of the forms 


(x+x',y)—(@y)-OLY), OytY)I-G.YW-OY), (ax, y)— (x, ary). 
Finally, consider the quotient module F/U. 


6.13 Let W be a free K-module, with basis (e;);ey. Deduce from §6.11(v,vi) that, 
if V is any K-module, every t € V @ W has a unique representation t = )~ j Xj @ej 
with x; € V. Thus, for every injective homomorphism V’ > V, the corresponding 
map V’@ W > V ® W is injective as well. If V, too, is free, with basis (d;)jer, 
the family (d; ® e;);,; is a basis of V @ W. Specializing to K-vector spaces V, W 
of dimensions m,n, we see that V ® W has dimension mn. 


6.14 Let A, B be K-algebras. For the moment we will write the underlying vector 
spaces of A, B as Ag, Bo. Show that Ag ® Bo can be given one and only one K- 
algebra structure in such a way that (a @b)(a’ ® b’) = aa’ ® bb’, and this algebra is 
then the tensor product of the K-algebras A, B in the sense of Section 6.3 (see F7). 
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6.15 Consider the Z-algebras Q and Z/n (:= Z/nZ). Show that Q @ Z/n = 0, 
and that Q. ® Q ~ Q and Z/n ®Z/n = Z/n (with canonical isomorphisms). 
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7.1 Let f € K[X] be irreducible and let L/K be a normal field extension. Show 
that if g, are normalized prime factors of f in L[X], there exists o in G(L/K) 
such that og =h. 

Deduce that in the prime factorization of f in L[X] all prime factors have the 
same exponent. 


7.2 Let E/K be a (not necessarily algebraic) field extension such that char K = 
p > 0, and take a € FE. Prove: 
(a) Ifa? € K but a ¢ K, then K(a)/K is purely inseparable of degree p. (Hint: 
F16, F15.) 
(b) K(a?) = K(qa) if and only if @ is algebraic and separable over K. (Hint: 
Consider F12.) 


7.3 Let f € K[X1,..., Xn] be a polynomial in 1 variables over the field K, and 
let M be a subset of K” on which f vanishes. Which of the following conditions 


imply f =0? 

(i) M is infinite; 

(ii) M = K"; 
(iii) M contains a set of the form A, x---x A,, where each A; is an infinite subset 

of K. 

Answer: In general only (iii). And the converse is false; if, for example, K = R, 
n=2and f vanishes on M = {(m, k + +) |m,k EN}, then f =0, but M does 
not satisfy (iii). 
7.4 Let F/K be an extension and suppose K is infinite. Let fi,..., f% be polyno- 
mial in F[X,..., Xp] and set Vi = {x € K” | fi(x) = 0} for each 7. Show that if 
K" =V, U---U Vx, at least one V; coincides with K” (that is, f; = 0). 


7.5 Suppose a field extension E/K satisfies E = K(a1,...,@n), where the a; 
are algebraic over K. Suppose @2,...,Q@, separable over K. Show that if K has 
infinitely many elements, EF contains an element a of the form 


(1) A= XQ, 4+---+Xya, with x; € K, 


for which E = K(q). This is the primitive element theorem, which is due to Abel. 
Hint: Using §7.4, show that there exists (x1,...,%,) € K” such that x; 4 0 and 


n n 
(2) Yalan A Ya for i =2,...,1r, 
j=l j=l 
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where 0; = 1, 02, ..., 0, are the r:=[E:K]; = E;: K distinct K-homomorphisms 
from EF into a normal closure F of E/K. Now, if (2) is satisfied and a is as in (1), 
it follows that E; = K(a); C K(q@), and therefore a2,...,a@, lie in K(a). Since 
x, #0, we then have a; € K(q@) as well. (A somewhat weaker version of the 
primitive element theorem will come up in the context of Chapter 8.) 


7.6 Let E = K(a1,...,@,) be as in §7.5, but assume that a2,...a@, are only 
separable over K(a,). Prove that there exists a € EF with E = K(q). 

Hint: Let F be the separable closure of K in E. Then E = F(a1). Now §7.5 
yields the assertion. (For the case of finite fields see Chapter 9, Theorem 2.) 


7.7 Suppose an extension E'/K of degree 2” is of the form FE = K(,/aj,..., /Gn), 
with a; € K. Prove that, if char K 4 2, then a := a, +---+ ./dy is a primitive 
element of E/K. (Hint: Otherwise there would exist o € G(E/K(a)) with o ¥ id.) 


7.8 Find a finite extension E'/K that has infinitely many intermediate fields. (Hint: 
Consider the field of rational functions E = F,(X, Y) in two variables, the subfield 
K =[,(X?,Y?), and the intermediate fields K(X +tY) fort € K.) 


7.9 Let E/K be a finite field extension. Show that if E is perfect, so is K. (Compare 
F18.) Hint: Why is E?:K? = E: K? 


7.10 Let E/K be a finite field extension. Then [FE : K] = [E: K]; [E: K];; see F17 
and the remark following it. Deduce that for any intermediate field F of E/K, 


[E: K]; =[E: F]i [F: kK]. 


7.11 Let L/K be an algebraic extension and h € L[X] a normalized polynomial. 
Prove that a necessary and sufficient condition for all the coefficients of 4 to be 
separable over K is that for every root a of / the multiplicity ordg h be divisible 
by [K(a): K]j. 

Hint: Let a be a root of h and f the normalized prime factor of / such that 
f(a) = 0. Then 


S(X) = g(XP") = T(x?" - a") = T(x -0)?", 


j=l i=l 


where p”™ =[L(qa): L]; and the a;,...,a@, are all distinct. 


Chapter 8: Galois Extensions 


8.1 (a) Solve §1.1 again, this time using Galois theory. (Hint: Consider ./a/ Vb in 
K(Ja, Vb).) 
(b) Take E = Q(/2, ¥/3). Using Galois theory, show that & = Q(/2+ 73). 
(Hint: Consider E C R.) 
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8.2 For E = Q(V2, 4/5), show: 
(i) E/@ is a Galois extension. 
(ii) G(E/Q) ~ Z/2x Z/2. List all the intermediate fields of E/Q. 


8.3. Suppose that the Galois group of an irreducible and separable polynomial f € 
K[X] is abelian. Let E be a splitting field of f over K and let a1,...,a@, be the 
roots of f in E. Show that E = K(q;) for any i, and hence that E: K = deg f. 


8.4 Let E CC bea field. Show that if E'/Q is a Galois extension and Ey = ENR, 
we must have EF: Eo < 2. Is Eo/Q always Galois if E/Q is? Prove: If E/K 
is a Galois extension, E9/Q is Galois if and only if the generator p of G(E/Eo) 
commutes with all elements of G(E/Q). 


8.5 Let K be a subfield of R and let E/K be a Galois extension of degree 4n whose 
Galois group is cyclic. Using F6 in Chapter 9, show that no element d < 0 of K is 
a square in E’. (Hint: Consider §8.4.) 


8.6 Let a1,a2,...,dn be pairwise relatively prime square-free integers of absolute 
value # 1. Show that Q(/aj,...,./dn)/Q is a Galois extension, with Galois 
group isomorphic to {-+1,—1}” ~ (Z/2)” (compare §2.6). For each 0 < k <n, 
the number of subfields of degree 2* is equal to the number of subfields of degree 
2"-k. find this number. 


8.7 Let E/K be a normal field extension, and let E's be the separable closure of K 
in E. Show that there exists a (canonical) intermediate field F of E/K such that 
FOE, =K and FE, = E; in particular, F/K is purely inseparable and E/F is 
separable. If E/K is finite, F': K is the inseparable degree of E/K. (Hint: Consider 
the fixed field of the automorphism group G(E/K) of E/K.) 


8.8 Let L/K be an algebraic extension with the property that every irreducible 
Jf € K[X] in L has at least one root (compare §6.4). Show that L is an algebraic 
closure of K. (Hint: Work in a fixed algebraic closure C of L, apply the primitive 
element theorem and use §8.7.) 


8.9 Let k be a field of characteristic p > 0 and let K =k(Y, Z) be the field of rational 
functions in two variables over k. Also let w be a root of X72? + YX? + Z € K[X] 
and consider E = K(q). Prove: 

(i) E/K is an inseparable but not purely inseparable extension of degree 2p. 


(ii) E/K has no proper intermediate field inseparable over K (see §8.7). Hint: 
You can use §7.1. 


8.10 Let E/K be a finite field extension and G a group of K-automorphisms of E. 
Show that G is finite and its order divides E : K. Moreover, |G| = E: K if and only 
if E/K is Galois and G(E/K) =G. 


8.11 Let a be a complex number satisfying w® + 3 = 0. Show that Q(a)/Q is a 
Galois extension, and determine its Galois group and all its intermediate fields. 
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8.12 Take f,) = X4—2 and fp = ¥*—2X? +2 and let E;, Ez be splitting fields 
of f,, fo, respectively, over Q. Prove: 
(i) E, = Q(i, V2), Ey = Q(V—2, /T +7), and both have degree 8 over Q. 
(ii) &, and E2 are not isomorphic, but EF, /@ and E,/@ have isomorphic Galois 
groups. 
Find all intermediate fields of F,;/Q and F2/Q. 


8.13 Determine the Galois group G of the splitting field E of ¥? + ¥ +1 over 
@. How many subfields does FE have, and what are their degrees over @? (Hint: 
Exactly one of the roots a1, @2,@3 is real. It follows that G ~ S3. Thus there are 
exactly three subfields of degree 3, namely Q(a1), Q(a2), @(a3), and exactly one 
of degree 2. By the way, the degree-2 field is @(/—31); see Section 15.5.) 


8.14 Let E be a splitting field of ¥° —2 over Q. Prove: 
Gi) £:Q=20. 
(ii) There exists precisely one intermediate field F of E/Q with E': F = 5, and 
F/Q is normal. 
(iii) /2 + €s is a primitive element of E/Q. 
(iv) The Galois group G of E/Q has elements o, t such that ord o = 5, ord t = 4, 


and tot~! =o?. 


List all intermediate fields of E/Q. (Note: Without knowledge of group theory 
some labor is involved in proving the uniqueness part of (ii), or at any rate in 
determining the intermediate fields; but see the results in Chapter 10, particularly 
Theorem | (Sylow’s Theorem), which turns the problem into a piece of cake.) 


8.15 Let E = k(X) be the field of rational functions in one variable over some 
field k. Any automorphism o of E/k is characterized by the image o(X) of X. 
Why does this (plus §5.8) immediately imply that o(X) has the form 

aX +b 


X= with ad —bc #0? 
cX +d z: 


The automorphism group of k(X)/< is thus canonically isomorphic to the projective 
linear group PGL(2,k) = GL(2,k)/k*. 


8.16 Let E = k(X) be as in §8.15. We make the identification Aut(k(X)/k) = 
PGL(2,k). Prove that finite subgroups G of PGL(2,/) are in one-to-one corre- 
spondence with intermediate fields K such that E/K is Galois, and in fact the 
correspondence is given by Gt» E© and K+ G(E/K). 

Prove that if G is a finite subgroup of PGL(2, x), there exists a rational function 
f €k(X) such that G contains precisely those o € PGL(2, k) that leave f invariant: 


(1) G ={0 €PGL(2,k) |of = f}. 


Hint: Ltiroth’s Theorem (§5.9). 
Here is an example: Take o;(X)=X, 02(X)= X71, 03(X)=1-X, o4(X)= 
(1—X)7!, o5(X¥)=(X¥-1)X~!, o6(X) =X(X-1)~!. These elements of PGL(2, k) 
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form a group G isomorphic to S3. Find an f/f satisfying (1). (Hint: One of the 
coefficients of the minimal polynomial of X over the fixed field of G is not constant; 
see §5.9.) 


8.17 In the field of rational functions E = k(X), consider the element 


(x?-xX +1) 


OS aT 


Show that the extension k(X)/k(J/) is Galois of degree 6, and determine its Galois 
group and all its intermediate fields. 

Watch out: The group defined by (1) is finite for any nonconstant f € k(X), 
but k(X)/k(f) is generally not normal. 


8.18 Set G = Aut(k(X)/k). Prove: 
(a) If k is an infinite field, k is the fixed field of G. 


(b) In the case of a finite field k with gq elements, however, the fixed field K of G 
is not the same as k; the extension k(X)/K is Galois of order g? — g. Hence 
there exists a nonconstant f € k(X) with denominator of degree g* — q and 
such that K = k(f). Using results from Section 9.1 one can show that 


_ (Xt xX)! 


1 Oye 


satisfies this property. 


Hint: Note that G is generated by the elements aX, X¥ + b and 1/X, witha e k* 
and bek. 


8.19 Let K be a subfield of R and assume f € K[X’] is separable and normalized. 
Show that the discriminant D(f) has sign (—1)’2, where rz is half the number of 
nonreal roots of f in C. 


8.20 Prove that the only homomorphism of the field R into itself is the identity. 


8.21 Let E/K be a finite Galois extension with group G, and let V be an n- 
dimensional vector space over E on which G operates semilinearly, meaning that 
o(x + y) = a(x) +0(y) and o(Ax) = o(A)a(x) for any x,y € V, A € E and 
o €G. Clearly the set V% of elements fixed by G is a K-vector space. Prove that 
there exists a K-basis of V© that is also an E-basis of V. 

(Hint: Show first that any v1,...,Um € V® that are linearly independent over 
K are also linearly independent over E. Then show that any linear functional on 
the E-vector space V that vanishes on V@ is trivial.) 


8.22 Let F' be an intermediate field of the Galois extension L/K. Set G= G(L/K) 
and H = G(L/F). Why is it that [F: K] = G: H, if L/K is assumed finite? Prove, 
more generally, that if [F': K] is finite, so is G: H, and then [F: K] = G:H. 
(Hint: There is a well defined from G/H into the set G(F/K,L/K) of all K- 
homomorphisms of F in L.) 

Prove that G: H < oo implies [F: K] < co. 
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8.23 Let a € C be an algebraic number and K C C a field. Why is K(a): K no 
greater than Q(a):Q? Prove that if Q(a@)/Q is normal, K(a): K divides Q(a):Q. 
(Hint: Why is K(a)/K Galois and why does every o € G(K(a)/K) give rise to 
a 00 € G(Q(a)/Q)? Considerations of this sort lead to the so-called Translation 
Theorem of Galois theory; see Chapter 12, Theorem 1.) 
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9.1 Keeping the notation of Theorem 1, prove that Fpm is a subfield of Fp» if and 
only if 7 is divisible by m. 


9.2 Let K be a finite field with g elements and let f € K[X] be irreducible. Prove 
that f divides X27" — X in K[X] if and only if n is divisible by deg /. 


9.3 Let p # 2 be prime. 


(a) Using Theorem 2, prove again (compare §3.9) that —1 is a square in F, if and 
only if p = 1 mod 4. 


(b) More generally, a € FS is a square if and only if a@?~)/? = 1, 


(c) Characterize finite fields F,» where —1 is a square. 


9.4 Take G = (Z/23)*. 
(a) Find all elements of order 11 in G. (Hint: ord 2 = 11.) 
(b) Find all generators of G. (Hint: ord 5=22.) 
(c) Show that the splitting field of X ?3_ 1 over F> has degree 11 (see F11). 


9.5 In his memoir “Sur la théorie of the nombres”, Galois (1811-1832) was the 
first to consider finite fields that are proper extensions of their prime fields. Actually 
he says nothing about their existence. He simply performs computations in them, 
calling the quantities that he manipulates “les imaginaires”. Although Galois himself 
does not offer any specifically number-theoretical applications of his investigations, 
he is visibly convinced of their usefulness. 

In his memoir Galois discusses the following example: The polynomial X 3—2 is 
irreducible over F 7; thus, if we fix one of its roots, calling ita = aos we have found 
a primitive element of F73/F 7, since F73 = F7(a@). Now try to find a primitive root 
¢=aa*+ba+c of F 3, that is, an element ¢ such that ord € = 77-1 = 342 = 19-3?.2. 
Clearly q@ itself is only a primitive 9th root of unity. Next try ¢ =a@+ 1. Galois 
chooses w? + a as a primitive root and states that (w + 1)!° =—1. Has he made a 
mistake? 


9.6 Determine the prime factorization of {() = X°>— X¥*—6X?+6X7-3X +3 
as a polynomial over Q, F3, and Fs (one at a time). Show that the respective Galois 
groups all have distinct orders. 


9.7 For any k €N let ¢; denote a primitive k-th root of unity in C. Show that, for 
any natural numbers m,n with d as their gcd and v as their lcm, we have: 
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(i) g(mn)g(d) = g(m)g(n)d. 
(ii) p(m)g(n) = o(v) g(a). 
(iii) Q(Em, fn) = Q(by). 
(iv) Qn) 1 Qn) = Qa). 
(Hint: See Theorem 3 and use §2.1.) 
9.8 Prove that cyclotomic polynomials F;,, form > 1 have the following properties: 
G) Fom(X) = Fin(—X) if m is odd. 
(ii) Fin(X?) = Fin(X) Fimp(X) for all primes p not dividing m. 
(iii) Fy(X) = Fin(X"/”) if m is the product of all primes dividing n. 
(iv) Fy(0) = 1 and hence F,(X¥~!)X9™ = F,(X). 
(v) F, (1) = p if n is the power of a prime p. 
(vi) F,(1) = | if 7 is not a prime power. 
The last two properties are particularly interesting in connection with arithmetic: 
In case (vi), 1—¢, is a unit in the ring R = Z[¢,], whereas in (v) the element x = 1—¢, 
is prime in R and p has in R the prime factorization p = ex?™, with ¢ a unit (this 
despite the fact that Z[¢,] is generally not a UFD; more precisely, Z[¢,] for p prime 
is not a UFD exactly when p > 23). 
9.9 Prove, with the greatest possible economy: 
(a) Fis(X¥) = X8— X74 X59 - X44 X3-X 41. 


(b) For primes p  q, all coefficients of F,,(X) have absolute value 1. Hint: 
Look at 


Fpyq(x) = (1—x)(Fq(x?)(1 — x2)") 
as an identity between power series. 


(c) The smallest n for which not all coefficients of F,, have absolute value | is 
n = 105. In fact, 


Figs OS eX a a eh 


Note: I. Schur showed that the coefficients of the F, can be arbitrarily large. 


9.10 Let p be a prime and suppose that for some a € Z and v € N we have 
a = 1mod p” but a 4 1 mod p’*!. Show that, apart from the case p = 2 and 
v = 1, there follows 


a? =1 mod p’t!, a? £1 mod p’*?. 
Thus, for p #2 and anyn €N, the residue class of 1 + p in (Z/p"Z)* has order 


p"|; for p = 2 the equality 1 + 2? = 5 determines an element of order 2"~? in 
(Z/2Z)*. 
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9.11 Prove: 

(a) If p is an odd prime andn EN, the group (Z/ p"Z)* is cyclic of order y(p”) = 
(p —1)p""!. (Hint: Let a represent a primitive root in (Z/pZ)*. We know 
that a?~! = 1 mod p. By taking p-th powers, a can (thanks to §9.10) be 
modified so that a?~! = 1 mod p”. Then the residue class of a(1 + p) has 
order (p— 1) p”—!. 

(b) For p = 2 andn > 3 the group (Z/2"Z)* = (—1) x (5) is the direct product 
of the cyclic subgroups generated by the residue classes of —\ and 5 (of order 
2 and 2”-? respectively). (See §10.8(b) for the notion of the direct product of 
groups.) 


9.12 Consider the abelian group G = (Z/mZ)*, where m > 1. Describe the 
subgroup G2 consisting of elements o such that 0” = 1. Show, in particular, that if 
r is the number of odd primes in the prime factorization of m and e is the exponent 
of the factor 2, then Gz has order 2”, 2’+!, or 2” +2, depending on whether e < 1, 
e = 2, or e = 3, respectively. (In the case m = 2”, with n > 3, write out the three 
elements of order 2.) For what values of m is G cyclic? (Answer: Only when 
m= p" orm=2p", with p prime and n > 1, where moreover n = 1 if p = 2.) 


9.13 What are the prime factorizations of Y? — X and X¥?7 — X in F3[X]? 


9.14 Let K be the field with 729 elements. How many subfields does K possess? 
Prove that K has exactly 696 elements @ with the property that K = F3(q), and that 
there are exactly 116 normalized prime polynomials of degree 6 over F3. Formulate 
more-general statements of this sort. 


9.15 Let K be a finite field. Show that every element of K is a sum of two squares 
in K. More generally, if a,b € K*, any c € K can be represented as c = ax? +by? 
in K. Hint: How many elements are in the sets {ax? | x € K} and {c—by*| ye K}? 


9.16 Prove that any root of unity contained in Q(¢,) has the form EE, 


9.17 Take n € N and a prime p not dividing n. Show that, for x € Z, we have 
F,(x) = 0 mod p if and only if ¥ € F has order n. (Hint: X" — 1 is a separable 
polynomial in F,[X’].) 


9.18 Take n € N. Show that there are infinitely many primes p such that p = 
1 mod n. (More generally, the famous Dirichlet Theorem says that for any integer 
a relatively prime to n there are infinitely many primes p = a mod n.) Hint: Let 
P(F,) be the set of all primes p such that the equation F,(X) = 0 mod p has a 
solution in Z. From §9.17 it follows, for every prime p{n, that 


pEePUh) — p=I1modn. 


Now the assertion follows from the general result given in the next exercise. 


9.19 Let f(X) =anX"+---+a1X +dao be a nonconstant polynomial in Z[X]. Show 
that there are infinitely many primes p such that the congruence f(X) = 0 mod p 
has a solution in Z. (Hint: If ao = 1, the result follows using the same sort of 


252 Appendix: Problems and Remarks 


argument as in Euclid’s proof that there are infinitely many primes. The general 
case can then be reduced to the case dp = 1.) 
Note: If P(/) denotes the set of primes p in question, it can actually be proved 


that 
1 
eos 

peP(f) ? 
Unfortunately the author is not aware of an elementary proof of this fact, in spite 
of some stabs at the problem. 
9.20 Why is /5 not in Q(£25)? (Hint: @(4/5)/Q is not normal.) Why is 15 not 
in Q(é15)? (Hint: /—3 € Q(f3), V5 € Q(é5).) Prove more precisely that Q(¢15)/@ 
has exactly 6 proper intermediate fields, namely Q(/—3), Q(V/5), Q(V—15), 
Q(V—3, V5), Q(és), and Q(6,5 + 73). 


9.21 Show that Q(ég)/@ has exactly three intermediate fields, namely Q(/2), 
Q(V-1), and Q(/—2). 


9.22 Why does Q(¢41)/@ have exactly 8 intermediate fields? Show, more generally, 
that if g is an odd prime and DF ... pr" is the prime factorization of g—1, then Q(¢,) 
has exactly (e;+1)...(e-+1) subfields. (Hint: Theorem 3, Theorem 2, F6.) 


9.23 Take E, = K(é,6) and E, = K(¢17), with K = Q. Show that the degree 
of £,/K is half as much as the degree of £2/K, but E,/K has more proper 
intermediate fields than F'./K (in fact, twice as many). 


9.24 Examine again the assertions of §3.8 in light of recently acquired knowledge 
and show how easily they can be proved now. 
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10.1 Let G be a nontrivial group, i.e., a group having more than one element. Prove 
the equivalence between: 


(i) G is finite of prime order. 

(ii) G has no subgroups apart from itself and 1. 
(ili) G is cyclic of prime order. 
10.2 Let G be a nontrivial group. Prove the equivalence between: 

(i) There is a proper subgroup of G containing all others. 

(ii) G is a finite cyclic group and its order is a prime power. 
(iii) G is finite and the set of subgroups of G is totally ordered. 
10.3 Let & be a field with algebraic closure C. Let a be a element of C \k. Prove 
that among the intermediate fields of C/k not containing a, there is a maximal one. 
Let K be such a maximal field and E/K a finite extension (inside C/A). Prove that 
if E/K is separable, E/K is Galois with a cyclic group of order equal to a prime 


power. (Hint: §10.2.) If E/K is not separable and p = char K, then E'/K is purely 
inseparable, and the extension K/K? has degree p. (Hint: See §8.7.) 
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10.4 Let G be a finite group and H a subgroup of G. Prove that if G: H = 2, then 
His normal in G and lies in the center of G. 
More generally, let p be the smallest prime dividing the order of G. Prove: 


(a) If H is a normal subgroup of order p, then H C ZG. 
(b) If H has index p, then H is a normal subgroup of G. 


(Hint: (a) Inner automorphisms of G give rise to elements of the group Aut 7 ~ 
(Z/pZ)*. (b) Let M be the set of subgroups of G conjugate to H. By letting G 
act on M by conjugacy, one gets a homomorphism of G into the group S(/) of 
permutations of M; look at its kernel.) 


10.5 Let G be a finite p-group and H a proper subgroup of G. Show that H is a 
proper subgroup of its normalizer Ng H in G. Consequences: If H is a maximal 
subgroup of G (that is, maximal among proper subgroups of G), then H is a normal 
subgroup of index p in G. Further: Every proper subgroup # of G lies in a normal 
subgroup NV of index p in G. (Hint: ZG # 1. Work by induction on |G|. The case 
ZG €& H is trivial.) 
10.6 Let G be a finite p-group and N ¥ 1 a normal subgroup of G. Prove: 

(a) NON ZGA1. 

(b) N contains a subgroup H normal in G and such that N: H = p. 


(Hint: Use the orbit formula for the action of G on N by inner automorphisms. Part 
(b) follows from (a) by induction.) 


10.7 Let N be a normal subgroup of a group G such that the quotient G/N is 
cyclic. Prove: 
(a) If in addition N is contained in the center of G, then G is abelian. Conse- 
quence: Every group of order p” (for p prime) is abelian. 


(b) If G is finite and G/N has order f, then G has a cyclic subgroup C of order /. 
If in addition f is relatively prime to the order of N, then NMC = 1 and 
G=NC. 


10.8 (a) Let H be a subgroup of a group G. Denote by p : G > G/H the map 
defined by o +> oH. Can a multiplication operation be defined on G := G/H 
in such a way that p(o)- p(t) = p(ort) for all o, t € G? If so, there is clearly 
only one way to do it; then G/H is a group with this operation, and p is a 
group homomorphism. Prove that the answer to the question is yes if and only 
if H is anormal subgroup of G. 

Let Hi, Hy be subgroups of a group G and let f : H; x H2 > G be the map 
(X1,X2) > X1X2. Show that f is a homomorphism if and only if x1 x2 = x2x1 
for all x; € Hy, x2 € Ab. In this situation we say that G is the direct product 
of H, and Hp if f is an isomorphism. Show that this happens if and only if 
the following conditions are satisfied: 


G= Hf, A, <G fori = 1,2, A,O A, =1. 


(b 


ma 


(Hint: Two elements a, b of a group commute if and only if their commutator 
[a,b] := aba~'b~ equals 1.) 
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(c) Let H1,..., Hy, be subgroups of a group G, and let H be their intersection. 
Show that if every H; has finite index in G, so does H. (Hint: The maps 
Di: G > G/H; defined by pi(o) = oH; give rise to a well defined map 
G/H > G/H, x---x G/An.) 


10.9 Let p be a prime number. Prove: 


(a) Any group of order p*m with m < p has anormal subgroup of order p*%. (Hint: 
The case m = p is taken care of by F8. For m < p see Sylow’s Theorems.) 


(b) If G is a group of order pq, where q < p is also prime and does not divide 
p—1, then G is cyclic. 

(c) If g is a prime distinct form p, every group of order p*gq has a normal Sylow 
p-subgroup or a normal Sylow q-subgroup. 


(d) Every group of order 2”-3 possesses a normal subgroup N with G: N = 3 or 
G:N = 2. (Hint: The action of G on M = Syl,G via inner automorphisms 
yields a homomorphism G > S(M).) 


(e) Every group of order 45 is abelian. 
(f) If g and r are primes with p < g <r and G is a group of order pqr, then 


one of the nontrivial Sylow subgroups of G is normal in G. (Hint: Otherwise 
there would be too many elements of order p, g and r all combined.) 


10.10 A subgroup H of a group G is called characteristic in G, and we write 
H <4G, if g(H) = @ for all automorphisms g of G. A group G # 1 is called 
simple if it has no nontrivial proper subgroup H with H < G, and it is called 
characteristic-simple if it has no nontrivial proper subgroup H with H <1G. Prove: 

(a) The condition H < N <G implies H 3G. In particular, any subgroup H of 
a cyclic normal subgroup N of G is normal in G. 

(b) If G is acharacteristic-simple finite group and N is a minimal normal subgroup 
of G (that is, minimal among nontrivial normal subgroups of G), then G~ N x 
---x N and N is simple. (Hint: G =(g(N) |g € AutG) = 91(N)...¢,(N), 
and for r minimal the product is direct.) 


(c) If Gx N x:--x WN and N is a simple group, G is characteristic-simple. 


10.11 Among groups G of order n < 60 there are no simple groups apart from 
cyclic groups of prime order. 

(Hint: By §10.9 the only cases that remain doubtful are n = 36, n = 40 and 
n = 56. For n = 40 a normal Sylow 5-group is available, for m = 36 you can argue 
as in §10.9(d), and for n = 56 you can count elements of order 7 and those of order 
a power of 2.) 


10.12 Let E be a splitting field of ¥’ — 6 over Q. Prove: 
Gi) £:Q=6-7. 
(ii) There is exactly one intermediate field F of E'/Q such that E : F =7, namely 
F = Q(67). 
(iii) 6+ C7 is a primitive element of E/Q. 
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(iv) The Galois group G of E/Q contains elements o,t such that ordo = 7, 
ord t= 6 and tot~! =o?. 

Let’s get an overview of all the intermediate fields of E/Q. If a,,..., a7 are the 
roots of X7 — 6, then besides @ and E one necessarily has the following pairwise 
distinct intermediate fields: Q(V/—7), Q(é7 + £7), Q(é7), Q(az), Q(ax, V—7), 
and Q(a,z,€7 +7), with 1 < k <7 in each case. Show that there are no others. 
(Hint: The number of Sylow 3-groups is either 1 or 7, and since the Q(a;, /—7) are 
distinct it must be 7. Note further that every subgroup of order 6 must be contained 
in one of these seven Sylow 3-groups. Correspondingly, a subgroup of order 14 
or 21 contains the only Sylow 7-group of G. How much room is left now for the 
Sylow 2-groups? The number of elements of order 2 can also be read off from the 
structure of G.) 


10.13 Let p* be a prime power and m any natural number. For every group G of 
order mp*, consider the set M of all subsets of G having p* elements. G acts on 
M by translation; for X €¢ M let Gy = {0 € G | oX = X} be the corresponding 
stabilizer. Prove: 


(a) For every ¥ € M we have |Gy| < |X| = p*, and equality holds if and only 
if X is a coset of a subgroup H of order p*. 


(b) Denote by ng =ng(p*) the number of subgroups of order p* in G. Then the 
orbit formula for the action of G on M yields the congruence 


k 
(" ) =ng-:mmod mp. 


This holds for all groups G of order mp*. In particular, if one takes G cyclic, 


we get 
k 
(" ) =m mod mp. 


Thus one reaches the following theorem of Frobenius: Jn any group of order 
p*m the number of subgroups of order p* is congruent to 1 modulo p. In particular, 
any finite group has Sylow p-subgroups, and their number is congruent to | modulo 


This chain of reasoning goes back to Wielandt (1959) and Miller (1915), but in 
those papers there is no reference to Frobenius’s result. 


10.14 Let P be a Sylow p-subgroup of a finite group G. Show, without using 
Sylow’s Second Theorem, that if P is normal in G it is the only Sylow p-subgroup 
of G (and conversely). More generally: If N is an arbitrary normal subgroup of G, 
the intersection N M P is a Sylow p-subgroup of N (and incidentally PN/N is a 
Sylow p-subgroup of G/N). 


10.15 Let N be a normal subgroup of a finite group G. Show that, if P is a Sylow 
p-subgroup of N, then Ng(P)N =G. 


10.16 Let P be a Sylow p-subgroup of a finite group G. Show that if H is a 
subgroup of G containing Ng P, then Ng H = H. 
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10.17 Let G be a finite group. Prove the existence of a Sylow p-subgroup of G by 
induction, using the class formula F6. (Hint: Why can one start by assuming that 
|ZG| #0 mod p?) 


10.18 Let G be a p-group of order p”. Deduce from §10.13 that, for any k <n, 
the number of normal subgroups of order p* in G is congruent to 1 modulo p. 


10.19 A finite group G is called nilpotent if all Sylow subgroups of G are normal. 
Show that for a finite group G there is equivalence between: 
(i) G is nilpotent. 
(ii) G is the direct product of its Sylow subgroups. 
(iii) Any proper subgroup H of G is a proper subgroup of Ng H. 
(iv) Any maximal subgroup of G is normal (of prime index). 
(v) Any nontrivial quotient group of G has nontrivial center. 
(vi) Any two elements of G whose orders are relatively prime commute. 
10.20 Let a finite group G act on a finite set M. For any o € G, denote by i(c) 


the number of fixed points of 0. Show that the average value of i(0) coincides with 
the number s of orbits: ‘ 
s=— i(o). 
Tes 


ocG 


Hint: Look at the subset {(0, x) | ox = x} of G x M and count its elements. 


10.21 Prove that a finite p-group G # | that has only one subgroup Ho of index p 
must be cyclic. (Hint: Every proper subgroup of G lies in a maximal subgroup 
of G. Now §10.19 implies H = Ho. The statement then follows using §10.2.) 


10.22 Prove that a finite group G is never the union of the conjugates tHt~! of a 
proper subgroup H <G. 

Simple and easy to prove as this fact is, it nonetheless plays a certain role on 
many different occasions. One can also stress its connection with §10.20 and recast 
it as follows: If a finite group G acts transitively and nontrivially on a set M, there 
exists o in G that leaves no point of M fixed. 


10.23 Let G be a group and take ae G. For meN, set Gu (a) ={x€G|x” =a} 
and Gm = Gm(1). Is Gm always a subgroup of G? Prove that G2 is a subgroup of 
G if and only if xy = yx for all x, y € Go. If G is abelian, every G,, is a subgroup 
of G. 

Now suppose G finite of order n. Prove: 


(a) If G is cyclic, then Gj» has order (m,n). 
(b) If G is abelian, the order of Gj» is divisible by (m,n), and more generally 


(*) IGm(a)| =0 mod (m,n). 
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(Hint for (b): One can assume m|n and m > 1. Choose a prime factor p of m and 
use induction on ™.) 

Does (*) hold when G is not abelian? In this connection consider the case 
G = 83 andm=3. 

But by a theorem of Frobenius (see B. Huppert, Finite groups 1), |Gm(a)| is 
always divisible by (m,n/c), where c is the number of elements of the conjugacy 
class of a in G. Thus one gets a statement of general validity if one replaces in (*) 
the order 1 of G by the order n/c of the centralizer Zg(a) of a in G. 


Chapter 11: Applications of Galois Theory to Cyclotomic Fields 


11.1 Let a be an algebraic number such that Q(aw):Q@ = 4. Prove that, if the Galois 
group of the minimal polynomials f of a over @ has order greater than 8, then 
a ¢ AQ. (One example is f(X) = ¥*+ X + 1; see §5.7.) 


11.2 What is the smallest angle whose measure in degrees is a natural number and 
that can be constructed with ruler and compass from Q? 


11.3 Let p be a prime and y = g mod p a generator of G := G(Q(é,)/Q) = 
(Z/pZ)*. Show that for any ¢t € N dividing p — 1 there exists a unique subfield 
K = K; of Q(€,) such that Q(¢,):K =¢t and K:Q = (p-—1)/t=:s. If H= H; 
is the corresponding subgroup of G, we have H = iL; Vendy yee Set 


t—1 
(1) ni(t)= lee" for 0<i <s-1. 
j=0 
Then no(t) lies in Ky, and no(t),...,s—1(t) are precisely the s distinct conjugates 


of no(t) over Q. It follows that K; = Q(no(t)) = Q(n; (0). In this way one gets a 
description of all subfields of Q(¢,) via the sums in (1), called Gaussian periods. 


11.4 Consider the situation of §11.3 in the case where p = 17 and ¢ = e!? with 
gy = 22/17. Show that g = 3 can serve as a primitive root mod 17. Then we have, 
for instance, 


no(2) =€+67' =2cosg, 
no(4) =€+¢44+¢-!'4+ 674 = 2(cosy + cos 49), 
no(8)= OR +4 4e7 +O gete et +e. 


Why is 70(8) + 71(8) equal to —1? By computing 79(8)*, or in any other way, 
derive the equality 


X? 4+ X —4 = (X —0(8))(X —m1(8)). 
Because 0(8) = 2(cos 8g + cos 4g + cos 29 + cos g) > 0 we then have 
no(8) = 3(-1+ ¥17), 1m (8) = 3(-1- V17) 
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(incidentally showing again that @(/ 17) is the quadratic subfield of Q(¢17)). Now 
clearly 79 (4) + 72(4) = no(8) and you should show that 79(4)n2(4) = —1, so that 


X? — o(8)X — 1 = (X — no(4))(X — 2(4)): 


similarly, 
X? — 1 (8)X —1 = (X —m1(4))(X — 034). 


It is easy to see that n9(4) > 72(4) and 7; (4) > n3(4). This implies, for example, 
that 


no(4) = $(-—1+ V17+ V34-2V17), (4) = ¢(—1- V17 + V3442V17). 


Finally, show that 
X? —no(4)X +m (4) = (X¥ —no(2))(X — n4(2)) 


and observe that 79(2) > n4(2). Putting it all together we get an explicit construction 
for no(2) = 2cos @ by successive adjunction of square roots, and hence an explicit 
ruler-and-compass construction for the 17-gon. 

11.5 Prove: 


(a) If p is a prime of the form 4k + 3, the number 2p + 1, if also a prime, divides 
2? — 1. Give at least two examples of numbers of the form 2? — 1 (Mersenne 
numbers) that are not prime. 


(b) Any prime divisor p of a Fermat number F;, with k > 2, satisfies p = 1 mod 
Qk+2 
11.6 (a) Is 14993 a quadratic residue modulo 65537? 


(b) Describe all primes p such that 7 is a quadratic residue modulo p. 


11.7 Let a and b be natural numbers, with b odd. Prove: 


(a) If (=) = | for all x EN relatively prime to 5, then b is a square. (Hint: Use 
the Chinese Remainder Theorem). 


(b) There are infinitely many primes p such that ($) = 1. (Compare §9.19.) 


(c) If () = 1 for all x € N odd and relatively prime to a, then a is a square. 
(Hint: Use the quadratic reciprocity law and part (a).) 


(d) Ifa is not a square, there exist infinitely many primes p not dividing a such that 
a is not a quadratic residue modulo p. (Hint: If pi,..., Pn have already been 
found, there exists y € N such that y = p; mod 4a and y= 1 mod p, ... pn.) 


11.8 Prove that the congruence (¥* + 1)(X4—4) = 0 mod p has a solution in Z 
for every prime p, although the corresponding equation has no solution in Z. 
11.9 Let n be an odd natural number. Prove: 


(a) If x"~! = 1 modn for all x € Z relatively prime to n, then n is square-free 
(but not necessarily prime; the smallest counterexample is n = 3-11-17). 
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(b) If (=) = x"—1)/? mod n for all x € Z relatively prime to n, then n is prime 
(this is a converse of sorts to Euler’s criterion). Hint: Taken = pm withm > 1. 
The assumption implies x@"~)/? = (+) mod p. By §11.7 there exists some 
y with (+) = —1. Now take x such that x = y mod m and x = 1 mod p. 


11.10 An algebraic number field is an extension K/Q@ with K:Q < oo. Such a 
field is called quadratic if K:Q = 2. It is called cyclotomic if it is a subfield of 
Q) = Q(fm) for some m. Prove that every quadratic number field is cyclotomic. 
More precisely, suppose K = Q(/d), with d # 1 a square-free integer. Set m = |d| 
if d = 1 mod 4 and m = 4|d| otherwise. Then K is contained in Q@° (and m is 
minimal with this property). 

This is a good place to mention the famous Kronecker-Weber Theorem: Every 
abelian number field K (meaning that K/Q is Galois with a finite abelian Galois 
group) is cyclotomic. (Kronecker did not supply a complete proof; this was done 
by H. Weber.) 


11.11 So that the example in §11.8 does not leave something of a false impression, 
let it be mentioned that a nonconstant polynomial f € Z[X| for which the congruence 
F(X) =0 mod p has a solution in Z for almost all primes p cannot be prime over 
Q. The proof of this fairly deep theorem (see F. Lorenz, Algebraische Zahlentheorie, 
BI-Verlag, 1993, p. 293) is founded on an approach pioneered by Kronecker, which 
in 1896 allowed Frobenius to prove the following stronger result: 

Suppose the Galois group G(f) of a polynomial f € Z[X] of degree n > 1 
contains an element o whose cycle decomposition comprises r cycles, of lengths 
N1,...,Ny (summing up ton). Then there exist infinitely many primes p such that f 
factors in F,[X] into r prime polynomials whose degrees are n,,...,Nr. 

Now, if f is assumed irreducible over Q, the group G(/) contains some o for 
which all the m; exceed 1 (see §10.22); thus there are infinitely many values of p 
for which f has no linear factor in F,[X]. Compare also with Chapter 16, F13. 


11.12 Factor X¥?~! —1 in F,[X] and derive Wilson’s Theorem: 
(2) (p —1)! =-—1 mod p. 


This was already known to Leibniz, as was the following “converse”: If (n — 1)! = 
—1l mod n forn €N, then n is prime. 

For p an odd prime, set ¢ = 3(p-1) and H = {1,2,...,t}. By taking together 
the factors a and p —a, for each a € H, derive from (2) that 


(3) (t!)? = —(-1)?-/? mod p. 


Thus, if p =1 mod 4, the number /! is a solution to the congruence Y? =—1 mod p; 
whereas t! = +1 if p =3 mod 4. In this latter case show more precisely that 


(4) t! = (—1)” mod p, 


where v is the number of elements of H that are not quadratic residues mod p. Let 
jt =t—v be the number of elements of H that are quadratic residues mod p. Prove 
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that uw =v= +(p-l) if p = 1 mod 4, but w ¥ v if p = 3 mod 4. (Remarkably, 
in the latter case v is always less than j2, but this is much harder to prove; see, 
for example, Z. I. Borevich and I. R. Shafarevich, Number theory, Academic Press, 
1966. More precisely, let p = 3 mod 4; if the so-called class group of Q(./—P) 
has order /, then 


_, _ fh if p=3 or p=7 mod 8, 
(5) es 3h otherwise. 


Thus, since 4+ v= $(p-1), we get 


=1(p-1)-1h or v=1(p—-1)—-3h, 


as the case may be. Note that / is always odd.) 


Chapter 12: Further Steps into Galois Theory 


12.1 Let F/K bea separable field extension of finite degree. Derive again, using F3, 
the primitive element theorem (Chapter 8, Theorem 3), which says that F' contains 
an element f such that F = K(f). Can one require that Trr/x(B) = 1? 


12.2 Let ¢, be a primitive root of unity in C. When do the conjugates of ¢, form 
a normal basis for Q(¢,)/Q? (Answer: If and only if 1 is square-free; see also 
§13.7.) 


12.3 In the finite field E = F,3, find: (a) a primitive root of E whose conjugates 
do not form a normal basis of E/F3; (b) a normal basis that does not consist of 
primitive roots of EF. 

For an arbitrary finite field £ with prime field F,, the extension E/F, does 
always have at least one normal basis consisting of primitive roots. This was proved 
by Carlitz for F large enough, and by Davenport for any £. Davenport’s proof is 
elementary, subtle and long (J. London Math. Soc. 43 (1968), 21-39). The statement 
remains true when F, is replaced by an arbitrary subfield K of E. 


12.4 By giving N the divisibility partial order and by taking the natural homo- 
morphisms finn: Z/nZ > Z/mZ, for m dividing n, one makes (Z/nZ)nen into a 
projective system of groups (or rings). Set 


Z =limZ/nZ. 


Prove: 


(a) For every finite field F, there is a canonical isomorphism G(C /F,) Z, where 
C denotes an algebraic closure of F,. 


(b) The open subgroups of Z are precisely the subgroups of the form nZ. The 
map Z > Zz gives rise to an isomorphism Z /nZ~ Zz / nz. There are nontrivial 
closed subgroups of Z other than those of the form nZ. 


Chapter 12: Further Steps into Galois Theory 261 


(c) For each prime p, set Zp = lim Z /p'Z. Show there is a natural isomorphism 
Z|] Zp: f 
DP 


12.5 Let W be the group of all roots of unity in C. Show that the following natural 
isomorphisms hold: 


G(Q(W)/Q) ~ 2” ~ lim(Z/nZ)* = T] ZX. 
n p 


12.6 Using Artin’s result (Theorem 2’) prove again Theorem 4 of Chapter 8. Hint: 
Suppose, in contradiction with equation (8) of that chapter (page 79), that E/K 
contains n linearly independent vectors b1,...,bn, with n > |G|. Now consider the 
system of linear equations 


Sion! (bj) x; =0 foro€G, 


j 
and notice that Trg # 0. 


12.7 (Algebraic independence of field homomorphisms). It is possible to strengthen 
Theorem 2 under the additional assumption that the ground field K is infinite. In 
fact, in this case a polynomial f in C[X1,..., Xn] that satisfies 


() f(01(B),.--.On(B))=9 forall Be E 


must be the zero polynomial. 
To prove this, choose a basis 61,..., Bn of E/K and consider the polynomial 


(2) ghia (oa (7) eee Y on(6)%). 
j=l j=l 


Because of (1) we have g(x1,...,X,) =0 forall x;,...x, € K. Since K is infinite, 
we have g = 0. But then f = 0 as well, for the following reason: By Theorem 2, 
the matrix (o;(8;));,; in M,(C) has an inverse, say (d;s);,5. By substitution in (2) 
we get 


n n 
(Dain keoes- Doane Xe) = f(%,..., XD=f 
k=1 k=1 


12.8 The theorem asserting the existence of normal bases (Theorem 3) appears 
to have first been stated by Emmy Noether, in a 1932 paper in Crelle’s Journal. 
Perhaps because the paper was essentially about number theory, or perhaps also 
because Noether was not sure the theorem was original, she merely outlined the 
proof of this purely algebraic result in a three-line footnote. As the nature of her 
argument makes clear, she was not including the case of a finite ground field. Was 
this all she meant when she remarked in a later work (Gesammelte Abhandlungen, 
p. 638) that her proof had “a gap’? In any case, her argument can be fleshed out 
into a proof of the theorem (for infinite ground fields) as follows: 
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Let E/K be a finite Galois extension with K infinite, and let E(X1,..., Xn) 
be the field of rational functions in the variables X,,..., X, over E. Clearly the 
extension E(X1,..., Xn) /K(X1,...,Xn) is Galois and its Galois group can be 
identified in a natural way with G = G(E/K). Choose a basis 61,..., Bn of E/K, 
form the element 


n 
(3) u=) BX; € E[X1,...,Xn), 
j=l 
and then the ” x n matrix (t~1o(u))z,6 in M,(E[X%,...,Xn]), where t and o run 
over the elements of G in some fixed order. Denote by d = d(X1,..., Xn) the 
determinant of this matrix. We claim that d is not the zero polynomial. Assuming 
this, d cannot vanish everywhere in K”, so there exist a,,...,@, in K such that 


d(a,,...,4n) #0. Now set a := }° a; B;; then 


(4) det (t~!o(a))r,0 £0, 


and this is enough to show that the elements o (a), for o €G, are linearly independent 
over K. 

To show that d £ 0, set Y, := a(u) = )) o (Bj) X; for every o € G; see (3). By 
Theorem 2 the n xn matrix (0 (5;))o,; is invertible. Thus E[X1,..., Xn] equals the 
ring of polynomials over E in the n variables Y,, for o € G. By definition, 

(5) d = det (Y,-1¢) 26: 

Following Frobenius, we call this the group determinant of G. It is nonzero for any 
finite group G and any field E, which can be seen as follows: On every row of 
the matrix M := (Y,-1,)z,¢ the variable Y; appears exactly once, and always on 
the diagonal. Thus, by expanding d = det M according to Leibniz’s rule, we get 
d = Yj' + g, where g € E[Y,|o € G| has degree less than n with respect to Yj. 
Therefore d cannot be 0. 


12.9 Let (o(@))ceg be a normal basis of a finite Galois extension E/K. If F is an 
intermediate field of E/K and we set H = G(E/F), the elements t(a), for t € H, 
may turn out to be linearly dependent over F,, so one does not get a normal basis of 
E/F in this way. Confirm this with the example E/K = Q(f9)/Q, a = 69 + 63, 
F = Q(63). 


Chapter 13: Norm and Trace 


13.1 Let A be a K-algebra and let L be a subalgebra of A contained in the center 
of A (so that A can be viewed as an L-algebra as well). Assume further that both 
L as a K-module and A as an L-module are free and finitely generated. Then the 
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same holds for A as a K-module. Prove that for any a € A we have 


Na/K(@) = Nix (Na/L(@)), 
Tr4/K (a) => Trz/x (Tt4/i(@)), 
Pajx (a; X) = NyxyKtxy(Pa/-(@ X)). 


(Hint: See LA II, p. 181, Aufgabe 63.) From this we can get F5 as a particular 
case, and hence also equations (26) and (27). 


13.2 Let E/K be a Galois extension with cyclic Galois group (a) of order 4. 
Assume also that char K 4 2. Prove that the quadratic intermediate field L of E/K 
must be of the form L = K(Va? +52), witha,be K. 

Hint: Write L = K(/d) and E = L(w), with w? € L. From G(E/L) = 
(o7) deduce that o7(w) = —w and then wo(w) € L. Moreover Ny/K(wo(w)) = 
—Ni/K (w*), so—le Nz/x(L”). The result follows upon observing that if —1 is 
a square in K, every element of K is a sum of two squares (why?). 

Prove also the converse: If L/K is a quadratic extension of the form L = 
K(Va* +57), with a,b € K, there is an extension E/L such that E/K is cyclic 
of degree 4. 


13.3 A field K is called pythagorean if every sum of two squares in K is a square 
in K. Prove that a field K such that char K 4 2 is pythagorean if and only if it has 
no cyclic extension of degree 4. 


13.4 Let E/K be a finite Galois extension with Galois group G. Every a € E” 
gives rise to a map g = gq: G > E™, defined by 


T 


(1) y(t) = —, 


where a’ := t(a) (accordingly, the notation for the group operation of G obeys 
a°* = (a°)*). The function ¢ thus defined clearly satisfies the functional equation 


(2) plot) = 9(c)* y(t). 


Any map g: G > E™* with property (2) is called a crossed homomorphism from G 
to E%*. 


(a) Suppose G is cyclic, with generator 0. Show that if g : G > E™ is a crossed 
homomorphism, g(o) has norm 1. Conversely, if y is an element of £* such that 
Ne/xK(y) = 1, there is a unique crossed homomorphism ¢ taking o to y. 

This equivalence reduces Hilbert’s Theorem 90 (see F9) to a special case of the 
following theorem of A. Speiser, which holds for any finite Galois group: Every 
crossed homomorphism y : G > E™ is split, that is, has the form (1). 

This more general statement, too, is often called Hilbert’s Theorem 90. (There is 
nothing objectionable about that, so long as the common practice of misattributing 
to E. Noether this generalization of the original Theorem 90 is avoided; in this 
connection see F. Lorenz, “Ein Scholion zum Satz 90 von Hilbert”, Abh. Math. 
Univ. Hamburg 68 (1998), 347-362.) 
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(b) Prove the preceding Theorem of A. Speiser—as a candidate for a in (1), start 
with 
a= 2 dgB° whereas, BEE, 


ocG 


and take into account the linear independence of all the o (Chapter 12, Theorem 2). 


(c) Let Ig E* denote the subgroup of E* consisting of all finite products of elements 
of the form a7! := a*/a, with w € EX and t €G. Prove that Nz/x(y) = 1 for 
any y € Ig E*. If G is cyclic with generator 0, we have Ig EX = (E*)°~!. Thus 
Hilbert’s Theorem 90 says that for G cyclic, the norm homomorphism Ng/x : E* > 
K™ has kernel Ig E*. 

For G noncyclic it is possible to have ker Nejx # JG E*. Establishing this 
looks at first like an easy algebraic exercise — take for instance K = Q and EF = 
K(Ja, Vb ) a biquadratic extension. But it’s not as simple as that; see the article 
cited in part (a). Nonetheless, N. Zimmermann (at the time a graduate student 
in Miinster) was able to prove explicitly, by relatively simple number-theoretical 
means, that in E = Q(V2, V3 ), for example, the element y = 1 + /2, which 
obviously satisfies Nex (y) = 1, does not lie in Jg E*. Actually in this case it can 
be shown even that the quotient ker Nz/x /Ig E” has order 2, but this is a deeper 
result; see again the article cited in part (a). 


13.5 Let f and g be distinct normalized irreducible polynomials in the polynomial 
ring K[X] over the field K, of degrees m and n, respectively. In some extension 
E of K, let w and B be elements satisfying f(a~) = 0 and g(6) = 0. Prove the 
following reciprocity result concerning the norm: 


Nxww/K (g(a): Nx (f(B)) | = (HD. 


13.6 A function f:N—C such that f 4 0 is called a number-theoretic function; 
f is called multiplicative if f(mn) = f(m)f(n) for all relatively prime m,n. 
Examples: the Euler totient function g; the function ¢ such that .(m) = 1 for all 
n; the function ¢ such that e(1) = | and e(n) = 0 otherwise; the multiplicative 
function jz defined by z(p) = —1 and p(p’) = 0, for p prime and e > 1. Given 
two number-theoretic functions f, g, define their product f * g by 


(f*g)(n) => f(d)g(n/d). 


d\n 


Prove that fxg=gxf, (fxg)*h= fx(gxh), ex f= fxe= f,andwxi=e. 
Derive from this the Mébius inversion formula: g = f * if and only if f = g * 


13.7 Given n € N, denote by ¢, a primitive n-th root of unity in C. Prove that 


(3) Tra(én)/a(Sn) = Wn), 


where yz is the Mobius function of $13.6. Hint: Define f via f(n) = Trace,)/a (Sn) 
and prove that >> din J (d) = e(n); this reduces the desired equality to the Mébius 
inversion formula. 
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For n > 1, let s2(n) denote the third coefficient of the n-th cyclotomic polyno- 
mial. Prove that 


(4) 259(n) = Tram (bn) — Tram a (Sn) 


and deduce that s2() can only take the values 0, 1,—1. The latter property is shared 
by the fourth coefficient —s3(n), as can be seen from the relation 


(5) 353(n) = Tram /a(Sn) — H(n)® + 34(2) 52(n) 


(compare §15.24). What secrets lurk behind the distribution functions s;(), for 
0 <k < g(n), can only be guessed at. 


Chapter 14: Binomial Equations 


14.1 Let K be a field containing a primitive n-th root of unity. Let E/K be a cyclic 
extension of degree n and suppose E = K(a@), where @ is an n-th root of some 
element a in K*. Take a’ € K*. Prove that E contains an n-th root a’ of a’ if and 
only if there is a natural number r for which a’/a” is an n-th power in K. When 
is K(a’) = E as well? (Hint: See the proof of Fl. Also, these results can also be 
easily derived from Kummer theory.) 


14.2 Let K be a field and g a prime distinct from char K. 


(a) Prove that an element of K~* that is a g-th power in K(¢,) is already a q-th 
power in K (compare F2). 


(b) Let E = K(qa) and E’ = K(q’) be extensions of K containing, respectively, 
q-th roots w and a’ of elements a and a’ in K*. If E/K ~ E’/K, there exists 
a number r relatively prime to g such that a’/a’ is a g-th power in K. If 
E’: K =q the converse also holds. Generalizations? Counterexamples when 
q is not prime? (See also §14.5 and §14.6.) Hint: One may as well assume 
E = E’. Now see §14.1 and part (a). 


14.3 Suppose char K = p > 0. Let E = K(q) and E’ = K(q’) be extensions of K, 
where a@, a’ satisfy so(a) = a and g(a’) = a’, with a,a’ € K (for go see Remark 1 
after Theorem 3). Prove that E/K and E'/K are isomorphic if and only if there is 
a natural number r < p such that a’—ra€ (K). 

More generally, formulate a Kummer theory, analogous to Theorem 4, for 
abelian extensions of exponent p; in this version the multiplicative group of K 
is replaced by the additive group of K and the n-th power map is replaced by go. 
This topic is explored further in §14.15. 


14.4 Let K be a field of characteristic p > 0. 


(a) Let E/K be a cyclic extension of degree p® 4 1, with o a generator of G = 
G(E/K). Set m = p*! and prove: There exists a unique intermediate field 
L of E/K such that E: L = p; this field is of the form £ = L(a), where a 
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satisfies so0(~) = a for some a € L; moreover oa = a+ 1. Prove that, as a 
consequence, E = K(q@). Prove also that 6B := oa —«a lies in L and satisfies 
oa—a= @(f) and Sz /x(B) = 1. Is it the case that L = K(f)? 

Let L/K be a cyclic extension of degree p°-! 4 1. Prove there exists an 
extension E of L such that E/K is cyclic of degree p®. (Hint: Use part (a) 
as a road map.) 


(b 


wm 


14.5 Let n be a natural number not divisible by 4. 


(a) Prove that if ¥” —a € Q[X] is irreducible, the Galois group G of X” —a over 
Q has order g(”)n or g(n)n/2. Do both cases really occur? (Hint: Investigate 
whether Y” —a is irreducible over Q(€,).) 

(b) Suppose we’re in the case |G| = g(n)n of part (a). Prove that G is isomorphic 
to the subgroup of matrices in GL(2, Z/nZ) having the form (6 an 


Nar 


14.6 Let g be a prime number and K a field of characteristic distinct from q; assume 
also that /—I € K if q = 2. Prove that any element of K that is a g”-th power in 
K(&q7) is already a q”-th power in K. (Hint: Use induction on n, and be aware that 
some perseverance is required; apply Theorem 2 and the results from §9.11, $10.2 
and §14.2.) 


14.7 Let E/K be a finite field extension and M a subgroup of E* such that 
K* CM. Assume the quotient M/K™~ is finite. Explain why 


[K(M): K]< M:K* 


necessarily. Prove that K(/)/K is separable if and only if the order of M/K™ is 
not divisible by char K. 


14.8 In the situation of §14.7, assume that E/K is separable and that M satisfies 
the following conditions: (i) For any prime p, any p-th root of unity contained in 
M is already in K. (ii) If M contains a 1+ €4, where 4 is a primitive fourth root 
of unity, then ¢4 € K. Prove in order: 
(a) If //K™* has prime order p, then [K(M): K]= p. 
(b) If M/K~ is a p-group and [K(M): K] = p, then M: K* = p. (Hint: See 
Theorem 2 and §14.2.) 
(c) If M/K™ is a group of order p”, then [K(M): K] = M: K™%. (Hint: Induction 
on the index [K(M): K]; from M = (a) M’ with M': KX = p""! deduce that 
X? —qQ? is irreducible over K(M’).) 
(d) [K(M): K] = M: K%. (Hint: Consider each Sylow group P/K* of M/K” 
and keep $14.7 in mind.) 


14.9 Let E/K be a field extension and a,,...,a@, elements of E. Suppose that 
a =a; € K*, where ,,...,m, are natural numbers relatively prime to the char- 
acteristic of K. Prove: 

(a) If K(aj,...,a@-):K =nyn2...n;, the following conditions are satisfied: (i) 


For any prime p, a product [| ‘aii aii cannot be a p-th power in K unless each 
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t; is a multiple of p. (i) A product Tain; aii , with n;t; = 0 mod 4, cannot be 
of the form —4A4 with A € K unless all the ¢; are multiples of 4. 

(b) If conditions (i) and (ii) in part (a) are satisfied and M is the subgroup of E” 
generated by a,,...,a@, and K™, then M/K™ has order nyn2...n,. 

(c) If conditions (i) and (ii) in part (a) are satisfied, then K(a@,,...,a@,):K = 
NNz...n,. (Hint: By part (b), M/K™ has order nyn2...n,. Now check that 
M also satisfies conditions (1) and (ii) of §14.8.) 


Apart from the assumptions made on the characteristic, these results are a direct 
generalization of Capelli’s Theorem (Theorem 2 in Section 14.1). 


14.10 Let G be any abelian group of finite order. Prove that G can be realized 
as a Galois group over the field @ (see the remarks after F5 in Section 15.3). 
Hint: Handle first the case where G is cyclic, by considering Q(¢,) for primes 
Pp = 1 mod |G|; see §9.18. 


14.11 Let p be any prime number. Prove that the Galois group of the polynomial 
X4? 4 4p? over @ has order v(4p) p. (Hint: Prove that —4p* is a fourth power in 


Q(C4p).) 


14.12 Let E/K be a cyclic extension of degree n, where K contains a primitive 
n-th root of unity ¢. Let o be a generator of G(E/K). The statement and proof 
of Theorem | (page 144) boil down to the existence of a nonzero a € EF such that 
oa = Ca. Justify the existence of such an @ using linear algebra. 

Hint: Consider o as an endomorphism of the K-vector space FE’, with minimal 
polynomial f. Since o” = 1, this polynomial divides X” — 1. Moreover f splits 
into linear factors, because ¢ € K, and it has only simple roots. Therefore o is 
diagonalizable (see LA I, p. 215). Now let W be the set of eigenvalues of o in K. 
Why is W a group? 

The goal is to show that ¢ € W. Suppose otherwise. Then k := ord W <n. 
Applying o* to a basis of eigenvectors of o we get o* = 1, a contradiction. 
(Note: Using the linear independence of 1, 0, o*,...,0"—!, we see right away from 
Theorem 2 in Chapter 12 that f(X) = X”—1, so ¢ is an eigenvalue of o.) 


14.13 Let m be a natural number greater than 2. Prove that 
Q(fm, 7/2): = 22""-2 and O(fm)NQ(7V2) = Q(V2). 


Hint: Setn = 2", a = Y2, K = Q(&;). To determine K(w):K use FI after 
observing that /2 € K, /2¢ K. 


14.14 Let p be a prime number, K a field of characteristic distinct from p, ¢ 
a primitive p-th root of unity (in an algebraic closure C of K), L = K(f), y a 
generator of G(L/K) and k a natural number such that €” := y(¢) = ¢*. Prove: 


(a) If E/L is a cyclic extension of degree p, the extension E'/K is abelian if and 
only if E = L( ¥/a) for some a € L™ such that a’~* € L*?. 


(b) There is a bijection between abelian extensions F'/K of exponent p (in C/K) 
and subgroups A of L* such that L*? C A and AYE SC L*?, 
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14.15 Let K be a field of prime characteristic p, C an algebraic closure of K 
and go: C — C the map given by (x) = x? — x. For each subset B of K, 
denote by Ez = K(#~!(B)) the subfield of C obtained by adjoining to K the roots 
of all polynomials X¥? — X — b such that b € B. Prove that the map Btw Eg 
provides a one-to-one correspondence between additive subgroups B of K such that 
§(K) © B and abelian extensions E/K of exponent p (in C/K); moreover Ep/K is 
finite if and only if K/§(K) is finite, and in that case there are natural isomorphisms 
G = (B/g(K))* and G* ~ B/g(K), where G = G(Eg/K). 

This is a generalization of the Artin—Schreier Theorem (Theorem 3), so it’s 
sometimes called Artin—Schreier theory. It represents an analog of Kummer theory; 
not a full one, however, since it does not encompass abelian extensions of exponent 
p” for n > 1. There is a theory for the latter as well, the key to which stems from 
the calculus of Witt vectors; see Volume II, Chapter 26, Theorem 6. 


Chapter 15: Solvability of Equations 


15.1 Let G be a solvable, characteristic-simple finite group (see §10.10). Prove that 
G is elementary abelian, that is, isomorphic to a product G ~ Z/pZ x --- x Z/pZ 
for p prime. In particular, every minimal normal subgroup of a solvable group is 
elementary abelian. 


15.2 Let G be a finite group. A chain G = Hp D> Hi >--- D> AH, = 1 of subgroups 
Hi; of G such that H; < Hj~, is called a normal series of G. The groups Hj—;/H; 
are called the factors of the normal series. If the Hj are all normal in G, the 
chain is called a principal series of G. The subgroup G’ of G generated by all the 
commutators (elements of the form aba~!b7! for a, b € G) is called the commutator 
subgroup of G. Define G® = G and G¢+) = G™’ by recursion. Prove: 


(a) G’ is the smallest normal subgroup of G with an abelian quotient group. 


(b) There is equivalence between: (i) G is solvable; (ii) G has a normal series 
whose factors are abelian. (iii) There exists n such thatG@ = 1. (iv) G 
has a principal series whose factors are abelian. (v) G has a principal series 
whose factors are elementary abelian ($15.1). 


15.3 Let K be a field of characteristic 0. Prove that for every n the Galois group 
of a polynomial of the form X4” + aX3" + bX?" +cX" +d € K[X] is solvable. 


15.4 Suppose a cubic polynomial f € Q[X] is irreducible and has three real roots. 
Prove that nevertheless there is no radical extension F/@ such that F C R and that 
F contains the splitting field of f over Q. 
15.5 Prove: 

(a) Every group of order 1 < 60 is solvable (use §10.11). 


(b) As is a simple group of order 60. (Hint: Otherwise As would be solvable, 
because of part (a); but this contradicts F6.) 
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15.6 Prove: 

(a) If a simple group G has a strict subgroup H of index n < 5 and G is not cyclic, 
then n = 5 and G ~ As. (Hint: G acts on G/H, and is therefore isomorphic 
to a subgroup G* of S,; since sgn must be trivial on G*, we have G* C Ay.) 

(b) Every simple group G of order 60 is isomorphic to As. Hint: Otherwise, by 
part (a), G has no strict subgroup H such that G: H < 5. Hence there must 
be fifteen 2-Sylow groups, ten 3-Sylow groups and six 5-Sylow groups. This 
means that there exists some x # | lying in two different 2-Sylow groups P 
and P’; since both are abelian, x lies in the center of H = (P, P’) =G,a 
contradiction. (Only the count of 2-Sylow groups is needed for the argument 
if one uses Remark (b’) following Sylow’s Third Theorem, page 100.) 


15.7 Prove: For every maximal subgroup H of a solvable group G, the index n = 
G: H is aprime power. Hint: By induction one can restrict oneself to the case where 
H contains no nontrivial normal subgroups of G. Moreover, under the action of G 
on G/H, every normal subgroup N # 1 acts transitively on G/H. Now consider a 
minimal normal subgroup N of G and use §15.1. 


15.8 Let a group G act ona set M with n elements. Prove that there exists a unique 
homomorphism 

sgn = sgng.wy:G—>{+1,—-]} 
with the following property: If 9: {1,2,...,”}—> M is any bijection and we denote 
by 0? =~ !oa og the element of S,, corresponding to o € G via ¢, then 


sgn(o) =sgn,(o%) forall o EG, 
where sgn, : Sy > {+1,—1} is the signature map. 


15.9 Let f € K[X] be a separable polynomial over a field K and let E be a splitting 
field of f over K. The Galois group G of E'/K acts on the set M = {a1,..., an} 
of roots of f in E; let sgn: G > {+1, —1} be the corresponding signature map (see 
§15.8). Prove that the element A = |]; ;(a@; — a,j) satisfies o(A) = sgn(o)A for 
ala eG. 

Let H = {o € G | sgn(o) = 1} be the subgroup of even permutations o of G. 
Assuming char K # 2, the fixed field of H in E is K(/D(/)); in particular, 
D(f) is a square in K if and only if G contains only even permutations. (For the 
definition of D(f) see Chapter 8, Definition 4; one may as well assume also that 
f is normalized.) 


i<j 


15.10 Consider the natural action of S;, on the rational function field k(X1,..., Xn), 
whose fixed field is K = k(s1,...,5n) (see F3). Set A = [i<j (Xi — Xj) 
and show, under the assumption chark # 2, that K(A) is the fixed field of 
the alternating group A,. As we saw in Section 15.3, the intermediate field 
k(s1,...,8n) of k(X1,..., Xn) is a field of rational functions in n variables over 
k. Jt is natural to ask whether this is the case also for the intermediate field 
K(A)=k(s1,...,5n, A), that is, whether there are in this field variables ty,...,ty 
over k such that k(s1,...,5n, A) = k(t),...,f). The answer to this question is 
unknown. 
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15.11 Why does Aq have no subgroup of order 6? Why are the solvable groups 
S3, A4 and S, not nilpotent? 


15.12 Prove: 


(a) The conjugacy class of a permutation o € S), of type c1,..., Cy has cardinal- 
ity m!/(c1!1% c2!2 ...cen!n™); equivalently, exactly c1!1°! co!2@ ...c,!n™ 
elements of S, commute with o. 

(b) An element o € S;, of type c1,..., Cn belongs to A, if and only if the number 
yj (j — l)c; is even. Let k(o) be the conjugacy class of o in S,. Given 
o € An, either k(c) is also a conjugacy class in Ay or the disjoint union of 
two conjugacy classes in A,, and the choice hinges on whether there exists an 
odd permutation that commutes with o. The former case happens if and only 
if c; = 1 for some even j or cj = 2 for some odd j. 


15.13 Using §15.12, show that As is simple. (Hint: Any normal subgroup N of As 
consists of full conjugacy classes in As. But the equation 1+20x+15y+12z=|N| 
has no solution in integers x, y,z > 0 when || is a proper divisor of 60.) 


15.14 Given a finite group G, consider the action of G on itself by left translations. 
Prove that the corresponding signature map is nontrivial if and only if G has even 
order and the 2-Sylow groups of G are cyclic. 


15.15 Let n be an odd natural number. Prove: 


(a) Any group G of order 27 contains a normal subgroup of index 2. (Hint: There 
exists tT € G of order 2. Consider the cycle decomposition of t as a permutation 
of G, and look at the image of t under sgn: G > {+1,-—1}.) 


(b) If G is a group of order 2*n containing an element of order 2“, then G has a 
normal subgroup N with cyclic quotient group of order 2k | (Hint: Argue by 
induction on k.) 


15.16 Let G be a group acting on a set M. We say that the action is k-transitive if 
any k-tuple of distinct elements of M is mapped to any other such k-tuple by some 
o€G. 


(a) Let G be a subgroup of the full permutation group S(/) of an n-element set 
M, acting 2-transitively on M. Prove that every normal subgroup N # 1 of 
G acts transitively on M. 


(b) Prove that for n > 3 the action of A, on {1,2,...,m} is (n—2)-transitive. 


(c) Using induction on n, prove that A, is simple if n => 5. Hint: To start the 
induction apply §15.13 or $15.5. Now let > 5 and regard A,_, as a subgroup 
of A,. Suppose 1 4 N <1 Aj. Since N acts transitively, we have Ay, = NAy-1. 
By the induction assumption, either Ay—; 1 N = An—1 or Ap-1 ON = 1. In 
the second case, take some a’ € N such that o’(n) = 1. Prove that there exists 
o € Ay_; such that oo’o~! £0’ but oo’! (n) = 1. This contradicts the fact 
that 4,1 N =1. 


15.17 For n = 3, determine all nontrivial normal subgroups of S,;,. (Answer: For 
n #4 the only one is A, and for n = 4 there is also the four-group V4; see §15.16.) 
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15.18 Derive from §15.17 Bertrand’s Theorem: For n # 4 the group Sy, has no 
subgroup H of index m, where 2 <m <n. (Hint: S, acts transitively on G/H; 
now look at S, > S(G/#H).) As an interesting historical footnote, when Bertrand 
proved his theorem in 1845 he had to make the assumption that for every natural 
number n > | there exists a prime p such that n < p < 2n; this became known as 
Bertrand’s Postulate and was later proved by Chebyshev. 

As to subgroups of index 7 in Sy, it can be proved (with more effort) that — 
assuming n # 6—the only ones are those that leave invariant a particular element 
of the set {1,...,}; otherwise put, every automorphism of S, is inner. The case 
n = 6 is exceptional: then there is also another type. 


15.19 Let p = 5 be a prime number. (a) Prove that there exists no € N, depending 
on p, such that for any natural number n > no the polynomial 


f(X) = (X—p)(X—-2p)...(X —(p—2) p)(X* +np) — p 


over @ is not solvable by radicals. (b) Same question, proving that f in fact has 
Galois group isomorphic to Sp. (Hint: For part (a) see F12, and for part (b) the 
remarks thereafter.) 


15.20 Let f € K[X] be irreducible of prime degree p. Suppose the splitting field 
L of f over K contains roots a 4 6 of f with B € K[a]. Prove that L/K is cyclic 
of degree p. (Hint: Using F1(b) in Chapter 6, prove that K(a) /K is Galois.) 


15.21 Let the subgroup G of S, act transitively on {1,2,...,n}. Prove that, if 
G contains a cycle of length m — 1 and also a transposition, then G = S,. (Hint: 
If (12...n—1) € G, show that G contains all transpositions of the form (i 7).) 


15.22 (a) Find the Galois group of the polynomial f/(X) = X°> +X +1 over Q. 
(b) Let a,,0,a@3 be the roots of ¥? — ¥2 +1 in C. Prove that the splitting field 
of f has exactly the following distinct nontrivial subfields: Q(/—3), Q(/—23), 
Q(V69), Q(/—3, V69), Q(a1,a2,03), A(ax), Q(ax, V—3), Q(az, V69), with 


1 <k <3 in the last three cases. 
15.23 Let K be a subfield of R and let f € K[X] be a solvable prime polynomial 


of prime degree p. Prove that D(f) > 0 if p = 1 mod 4. For p = 3 mod 4 the 
inequality D(f) <0 is equivalent to f having a single real root. 


15.24 In the polynomial ring Z[X1,..., Xn], consider for i > 0 the polynomials 
pi= Xi + D8: ++++-+ X7 (sums of powers) and prove Newton’s formulas: 


Pr — $1 Pr-1 + $2 Pr—2 —+** +(-1)"SnPr-n = 0 for n <r, 
Pr —81 Pr—-1 + $2 Pr—2 — +++ + (F1)" Spr =0 forn>r=>1. 


Hint: With f as in F3, set g = f* = X" f(X—!) = )\(-1)'5; X". Then the quotient 
Xg'(X)/g(X) can be expanded in power series as n—) “7-9 py X*. The assertion 
follows by comparing coefficients for Xg’(X). 

Incidentally, log f*(X) has the power series expansion — )°7—_,(1/k) pp X e 
from which it likewise follows that each of the px’s is a polynomial in s1,..., 5% 
with integer coefficients (where s, = 0 for k > n). 
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15.25 Let G be the Galois group of the polynomial f(Y) = ¥*— X —1 over Q 
(see page 189). Use Theorem 7 to prove that G = Ss. 

Hint: By §16.10 below the discriminant D of f equals 2869 = 19-151. Therefore 
G is not contained in As, by §15.9. Suppose G # Ss. Then GN As is a Strict 
subgroup of As, and therefore solvable (§15.5). But then so is G. Theorem 7 and 
its proof then imply that G = NH, with N <G and H <G, where N is generated 
by a cycle of length 5 and H is isomorphic to a subgroup of FZ. Since G is not 
contained in As, there must exist t € H such that sgnt = —1. Since G 4 Ss, this 
T cannot be a transposition. Deduce that t is a cycle of length 4. Letting F' be the 
fixed field of N, the extension F/Q is then cyclic of degree 4, with /D €¢ F. Now 
§13.2 leads to a contradiction, because D = 19-151 is not of the form a? + b? with 
a,b € Q. (To see this apply §9.3). 


Chapter 16: Integral Ring Extensions 
with Applications to Galois Theory 


16.1 Let A/R be an integral ring extension. Prove that, if A is an integral domain 
and R is a field, A is a field. 


16.2 Let A/R be an integral ring extension. Let $8 be a prime ideal of A and p a 
prime ideal of R such that 8 lies over p, meaning that 3M R = p. Prove: 


(a) If p is a maximal ideal in R, then $8 is a maximal ideal in A. (Hint: Go over 
to A/B.) 

(b) If Bo is a second prime ideal of A lying over p and it satisfies Bo C , then 
3B = Bo. (Hint: By going over to A/3Bo one can assume that A is an integral 
domain and 3M R = 0.) 


16.3 Let A/R be an extension of commutative rings and let S be a multiplicative 
subset of R (hence also of A); see §3.12. The ring S~'R can be regarded as a 
subring of S~!A. Prove that if A/R is integral, the ring extension S~'A/S7!R is 
also integral. 


16.4 Let A/R be an integral ring extension, q an ideal of R and qA the ideal of A 
generated by q. Prove that every a € qA satisfies an integrality equation f(a) = 0 
whose coefficients, apart from that of the leading term, lie in q. (Hint: Suppose 
O = aj, + +++ +ayQy, with a; € q and a; € A. The subring A’ = Riay,..., an] 
is finitely generated as a module over R (see F3), and aA’ C qA’. Now adapt the 
proof of F1 to derive the assertion.) 


16.5 In the situation of §16.4, assume further that A is an integral domain, R is 
integrally closed in its fraction field K and q is a prime ideal. Prove that if a € qA, 
the minimal polynomial g of a over K has all its coefficients in q, apart from that 
of the leading term. (Hint: Take f as in §16.4 and consider f(X) = g(X)A(X) 
modulo q.) 


Chapter 16: Integral Ring Extensions with Applications to Galois Theory 273 


16.6 Let B/R be an extension of commutative rings and q a prime ideal of R. If 
qBO R =q, there exists a prime ideal Q of B such QM R = q (and vice versa). 

Hint: Set S = R~q; then gBN S = @. Thus qS~'B is a proper ideal of S~'B 
(see §4.12) and thus lies in a maximal ideal 0 of S~'B. Now set Q = MN B, in 
the sense of §4.12. 


16.7 The fundamental theorem on symmetric functions in Chapter 15 was only given 
for the case of polynomials over a field, since that was what the context demanded. 
Now, with the methods of Chapter 16, prove: 

Let R be a commutative ring with unity. Then every symmetric polynomial h in 


R[X1,..., Xn] has a unique representation h = g(s1,...,8n) with some polynomial 
g € R[M,..., Xn]. Hence the ring of symmetric polynomials of R[X,,..., Xn] 
coincides with the subring R[s1,...,8n] of R[X1,..., Xn], and it is even a polyno- 
mial ring in S1,..., Sn as variables over R. 


Hint: Since the question is one of existence and uniqueness involving finitely 
many coefficients only, it suffices to prove the theorem for the case of a polynomial 
ring R= Z[Y,..., Y;] over Z. Let k = Frac R. From Chapter 15, F4 it follows 
that there exists a unique g in k(X;,..., X;,) such that A = g(sj,...,8,). There 
remains to show that g in fact lies in R[X1,..., X,]. This follows from the fact 
that h(X),..., X») is integral over R[s1,..., 5,] and the latter ring is a UFD, hence 
integrally closed. 


16.8 Derive from §16.7 the existence of a certain n-variable polynomial d, over 
Z with the following property: If f = ¥” + a,;X"~! +-+-+ ay is any normalized 
polynomial of degree n over any field K, the discriminant of f is given by D(f) = 
dn(d1,..., An). 
16.9 In the situation of Section 16.3, prove: 

(i) D(f) €Z (or D(f) € R as the case may be). 

(ii) f has no multiple roots if and only if D(f) #0 mod p (or D(f) 40 mod p). 


16.10 (a) Let E/K be a field extension of degree n and a@ a primitive element of 
E/K, with minimal polynomial /. Prove that 


D(f) = (-I)""P? Nex (f'(@)). 


(b) Let K be a field. Prove that a polynomial f € K[X] of the form f(X) = 
X" + bX +c has discriminant 


D(f) = (HI? (ne! + (HI! — 1) 18") 


Hint: By §16.8, you can assume that f is irreducible and that char K = 0. 
Suppose f(a) = 0; set B = f’(q@) and show that a can be expressed in terms 
of 6 by means of a simple formula. Now compute the minimal polynomial of 
Bf and so also Nxq@)/K(B). 


16.11 Prove that the two statements in F12* are indeed equivalent. 
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16.12 Let A/R be an extension of commutative rings and let p be a prime ideal 
of R. Prove that, if A/R is integral, there exists a prime ideal 8 of A such that 
BOR=p. (This going up theorem goes back to Krull, but is often named after 
Cohen and Seidenberg). 

Hint: Consider first the case where R is a local ring (see §4.13) and p its maximal 
ideal. Then it’s actually the case that every maximal ideal $B of A satisfies BN .R=p, 
as can be easily seen by applying F11 to the rings 4/58 and R/BN R. The general 
case can be reduced to the local case by examining the ring extension S~!A/S~'R, 
where S = R~ p; see §4.13, §4.12 and §16.3. 


16.13 Let E/K be a finite separable field extension of degree n, and let 01,..., 0% 
be the distinct K-homomorphisms from F into the algebraic closure C of E. Fix a 
basis 61,..., 8, of E/K and consider the matrix 


(1) M = (6;(B)))i,;- 


Its determinant A is nonzero (Chapter 12, Theorem 2). The square 


Dax (Bisceic ha Ha 


is also nonzero. But the square must lie K, as can be seen for instance from the 
equality A? = det ‘M det M and its consequence 


(2) Deg/K(B1,---+Bn) = det((Sz/x (6: 8;))i,3)- 


The element Dz/x(B1,..., Bn) € K is called the discriminant of the basis B,,..., Bn 
of E/K. If E = K(a) and f = MiPox(q), the discriminant of the basis 1,a,..., 


a”—! agrees with the discriminant of the polynomial f: 


(3) Dex (1,0,...,a" 1) = D(f) = 1)" P? Nejx (f'(@)); 
see (11) in Chapter 12 and (17) in Chapter 8. 


Now assume that K is the fraction field of an integrally closed domain R, and 
let A be the integral closure of R in E’. Suppose a basis 61,..., Bn of E/K consists 
only of elements of A; such a basis exists by F7. Prove that 


1 
(4) AC p (RB +++ + RBn), 
with D = De/x(P1,...,Bn) € R. If we now choose a € A with FE = K(q), then 
(5) AC RI ] 
S 5 Rlel- 


Hint: Any f € A has first of all a representation B = )~ ; xj Bj, with each x; in K. 
Applying o; one gets 


o;(B) = > xy0:(8)) for 1<i<n. 
Jj 


The assertion follows using Cramér’s Rule (see for example LA II, p. 12). 
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16.14 Let E = Q(V/d), where d € Z is square-free and distinct from 1. Find the 
integral closure A of Z in E. Hint: An element a € E is an algebraic integer if 
and only if its trace an norm are integers. Use this to show that A = Z[Vd ] for 
d =2 mod 4 and for d = 3 mod 4, whereas A = VARA + Vd)| for d = 1 mod 4. 


16.15 Let K be an algebraic number field, that is, an extension of Q such that 
K:Q <oo. Denote by Ox the integral closure of Z in K. Prove that Ox possesses 
a Z-basis with n = K:Q. Such a basis is called an integral basis of K. (Hint: 
Using equation (4) in §16.13 above, the assertion follows easily from Section 14.3; 
see the proof of Theorem 6 there.) 

Prove further that all integral bases of K have the same discriminant; the integer 
Dx thus defined is called the discriminant of the algebraic number field K. Find 
the discriminant of the quadratic number field K = Q(V/d) (see §16.14). 


16.16 Let E/K be a extension of algebraic number fields, of degree d, and suppose 
that E = K(fm), where ¢ is a primitive m-th root of unity. Prove that 


mOg C Ox[Em). 


Hint: f = MiPog (fm) divides X"™ — 1. Therefore f’(¢) is a divisor of m in Og. 
Now use equations (3) and (5) from $16.13. 


16.17 (a) Let p” be a prime power and ¢ = €pr a primitive p’”-th root of unity in C. 
Prove that the integral closure of Z in the field F = Q(¢) is Of = Z[€]. 
Hint: Set 7 := 1—-—€. In the ring Z[¢] = Z[z] we have (p) = (2°) with 
e = F:Q; see §9.7. Now use this to show that 


por NZ[o] = pZ{é}. 
At the same time, by §16.16, there is a power p® such that p*Or C Z[C]. 


(b) Now let 7 be any natural number and ¢, a primitive n-th root of unity. Prove 
that Z[,] is the ring of algebraic integers of Q(é,). Hint: Use induction 
founded on §16.16(a). 


16.18 Let 7 be a natural number and c > 0 a real number. Prove the following simple 
but remarkable observation of Kronecker: There are only finitely many algebraic 
integers a whose conjugates have absolute value at most c and such that Q(a) :Q 
is at most n. Hence a given algebraic number field K (see §16.15) contains only a 
finite number of algebraic integers whose conjugates all have absolute value at most 
c. (Hint: The coefficients of the minimal polynomial of an algebraic number @ can 
be expressed in terms of the conjugates of a.) 


16.19 Deduce from §16.18 that any algebraic integer a whose conjugates all have 
absolute value 1 must be a root of unity. Is this also true when a@ is only assumed 
to be algebraic? 


16.20 Let K be a subfield of C and let Ko = KNR be its maximal real subfield. We 
say that K is a complex multiplication field, or CM-field in short, if two conditions 
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are satisfied: (i) K: Ko = 2, and (ii) Ko is totally real, that is, every homomorphism 
o:Ko—C maps Ko into R. An important example of a CM-field is K = Q(¢,). 
Assume that K is a CM-field and prove: 
(a) If 0: C > C is complex conjugacy and o : K — C is any homomorphism, the 
composition o~!po is defined and coincides with p on K. 
(b) If @ € K has absolute value 1, so do all conjugates of a. 
(c) If ¢ is a unit of K, that is, a unit in the ring Ox of algebraic integers in K, 
then ¢/é is a root of unity in K (see §16.19). 


(d) The map d : a > a/@ gives rise to an exact sequence 


d 
1 > Wr Ex, > Ex > W/W. 


where Wx is the group of roots of unity of K and Ex, Ex, are the groups of 
units of K, Ko. In particular we have Ex: Wr Ex, < 2. 


16.21 (a) Prove that K = Q(Vv2, V1i+ i) is not a CM-field (see §16.20). Find the 
maximal real subfield Ko of K. 
(b) Find an algebraic integer of absolute value 1 whose conjugates don’t all have 
absolute value | (but compare §16.20(b)). Hint: Consider K(V2) with K as 
in (a). 
(c) Let @ be an algebraic number such that @ 4 a and a@ € Q. Prove that if all 
conjugates of a have the same absolute value, then Q(a) is a CM-field. 


16.22 (a) Let p” be an odd prime power and ¢ = ¢,r a primitive p”-th root of unity 
in C. Prove Kummer’s Lemma: Every unit ¢ in Z[¢] has the form ¢ = &n for 
some p’-th root of unity € and some real unit 7. 

(b) Does the same hold in the case p = 2? 

(c) Let n be a natural number and ¢, a primitive n-th root of unity. Assume also 
that nm 4 2 mod 4. Set E = Z[¢]*, Eo = ENR and W = (—,). Prove that 
E #4 WE) if n is not a prime power. (Hint: See §16.19 and §16.20, and take 
§9.8(vi) into account.) 


16.23 (a) Take K = Q(a), where a? +a+1=0. Justify why Dx = d(1,a,«7) 
and hence why 1, a, a? is an integral basis of K. 
(b) Take K = Q(a@), where a> —a@ + 1 =0. Show that Ox = Z[a’]. 
(c) Take K = Q(a), where a? + a? —2a + 8 = 0. Prove that Ox 4 Z[a]. (Hint: 
Consider 4/a.) 


16.24 Let K = Q(¢,), where ¢, is a primitive n-th root of unity, with n > 2. Prove 


that 
nem) 


II per/Pp-1 : 
pin 


De = Cee? 


Hint: Consider first the case n = p’; then the n-th cyclotomic polynomial f = F, 
satisfies X?" —1 = (xe —1) f(X). Taking the derivative, plugging in ¢, and 


Chapter 18: Fundamentals of Transcendental Field Extensions 277 


taking the norm then leads to the desired assertion. For other values of n work by 
induction; use the fact that if A is an r x r matrix and B is an s x s matrix, the 
tensor product matrix A ® B = (a;; B) has determinant det(A)* det(B)” (see LA II, 
p. 181, problem 63). 


16.25 Let m,n be odd integers, with n not divisible by 5. Prove that the Galois 
group of the polynomial 
f(X) = X°4+5mX +5n 


over Q is isomorphic to S5. (Hint: Use Theorem | with p = 2.) 
16.26 Prove that all the roots of the polynomial 


F(X) =X = 20X74 OF 41 


are real, and that the Galois group of f over @ is isomorphic to Ss. (Hint: Use 
Theorem 3 of Chapter 14 with p = 5 and F13 of Chapter 16 with p = 2.) 


16.27 Prove the irreducibility of the polynomial 
f(X) = X°-5X +12 


over @, convince yourself that f has only one real root and that D(f) is a square, 
and deduce therefrom that the Galois group G of f over @ must be isomorphic 
either to As or to a subgroup of order 10 of As (see Section 15.6). (Note: It turns 
out that G is not isomorphic to As, but this is apparently not to be proved without 
a good deal more trouble.) Prove that 


f(X) = X° + 20X — 16 


has Galois group over @ isomorphic to As. Hint: Again infer from §16.10 that 
D(f) is a square. Use F13 with p = 3 and p =7 to show that f is irreducible and 
G(/) contains a 3-cycle. 


Chapter 18: Fundamentals of Transcendental Field Extensions 


18.1 Let E/K be a field extension. Prove the equivalence of: (i) FE: K(B) is finite 
for any transcendence basis B of E/K. (ii) TrDeg(E/K) < ov, and E: K(B) is 
finite for any transcendence basis B of E/K. (iii) There exists a finite transcendence 
basis B of E/K such that EF: K(B) is finite. (iv) £/K is finitely generated, that 
is, E = K(x1,...,Xn) for some x1,...,X, in E. 


18.2 Let E/K be a field extension. Prove: 


(a) If M C E is algebraically independent over K and F is an intermediate field 
of E/K such that F'/K is algebraic, then M is also algebraically independent 
over F’. 
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(b) If £/K is purely transcendental, K is algebraically closed in E. 

(c) Let B be a transcendence basis of E/K and M a subset of FE such that 
E/K(M) is algebraic. Then B can be written as a union B = Ley Ba. 
where the B, are finite sets. Thus for B infinite the cardinality of B does not 
exceed that of M. 

18.3 Prove: 
(i) TrDeg(R/Q) = TrDeg(C/Q) = Card(R). 
(ii) Aut(C) = Aut(C/Q) is uncountable (contrast with §8.20). 


(iii) If C is the algebraic closure of Q in C and B is a transcendence basis of C/C, 
then C/C(B) is never finite and hence C/C is not purely transcendental. 
(Hint: Consider VB ) 


(iv) There exist subfields K of C such that C:K =2 and K £R. 

18.4 In the polynomial ring K[X,,...,X;,] over the field K, consider the power 
sum polynomials p;,..., Pn defined in §15.24. Prove that if K has characteristic 
ZeYO, P1,.--, Pn are algebraically independent over K. Hint: $15.24 implies that 
K[pi,.--, Pn] = Kls1,.-., 5]. 


18.5 Let k be a field of characteristic other than 2. For any a,b € k™*, consider the 
polynomial 


(1) f(X, Y) =aX? + bY? -1eEk[X, Y] 


and the corresponding homogeneous polynomial oF =aX?+bY*—Z? ink[X, Y, Z]. 
Let C be an algebraic closure of k(X) and y an element in C such that f(X, y) =0. 
Set x := X and F := k(x, y). Prove: 


(a) f is absolutely irreducible, that is, irreducible over the algebraic closure Co 


of k (in C). 
(b) TrDeg(F/k) =1, F: k(x) =2, and k is algebraically closed in F’. 


18.6 Let the setup be as in §18.5. We say that the projective curve over k defined 
by (1) has a rational point if there is a point (a, 8, y) # 0 in k? such that 


f(a, B, y) = aa? + bp? —y? =0. 


Prove that the following statements are equivalent: 
(i) The projective curve over k defined by (1) has a rational point. 
(ii) b is a norm in the extension k(./a)/k. 
(iii) F/k is purely transcendental, that is, there exists t € F such that F = k(t). 


Hint: Suppose aa? + b— B? = 0, with a, B € k. Parametrize the line joining the 
point (a, B) to the point (x/y, 1/y) of the curve by its slope; that is, set 


pe SOVEY | A REY, LEY 
“ x/y-a@ x-ay “ x+ay 


Chapter 18: Fundamentals of Transcendental Field Extensions 279 


An easy calculation shows that st = a. Then one can find x, y by solving a system 
of linear equations with coefficients in k(s,t) via Cramér’s rule; the result is 


_ 2B-(s+t)a 7 s—t 
~— B(s+t) —2aa’ am B(s+t) —2aa’ 


with a nonzero denominator. 
These formulas become, in the case of the equation x* + y* — 1 = 0 of a circle, 


(2) 


—a rational parametrization of the circle! Discuss also the curve x* + y?+ 1=0, 
which has no rational points over Q, over the field k = Q(). 


18.7 Using (2), parametrize all the rational solutions of the equation x? + y? = 1. 
As an application, prove that the set of solutions (a,b,c) of X? + Y? = Z? in 
natural numbers a,b,c (such triples are called Pythagorean) with a,b,c relatively 
prime and 6 even is parametrized by 


a=m—n’, b=2mn, c=m’ +n’, 
where (m,n) runs over all pairs of relatively prime natural numbers such that m >n 
and mn is even. (Obviously any Pythagorean triple can be reduced to one satisfying 
the stated restrictions.) 


18.8 Let k be a field of characteristic distinct from 2. Lest the special examples 
of curves in §18.5 to §18.7 leave something of a false impression, it should be 
said that for an arbitrary curve defined over k the situation for k-rational points 
may be radically different. For example, replace the polynomial (1) of §18.5 by a 
polynomial of the form 


(3) f(X,Y) =Y?-d(X), 


where d(X) is any separable polynomial over k of degree at least 3, and let 7 be 
the corresponding homogeneous polynomial. A (projective) curve defined over k by 
a such a polynomial f is called hyperelliptic (beware: often this term is restricted 
to the case deg d > 5). 

Ifk = Q and deg d = 5, the curve defined by (3) always has at most finitely many 
rational points. This follows from a general result of Gerd Faltings, one of the most 
significant advances in mathematical research from the last couple of decades. 

We also say something about the case where d has degree 3; the projective 
curve determined by (3) is then called an elliptic curve over k, and it must contain 
at least one rational point over k, since f(0, 1,0) = 0. (When deg d = 4, too, one 
sometimes talks of the curve as being elliptic, but such a curve need not contain 
regular rational points over k. If such a point does exist, the curve can be reduced 
to the case of degree 3 by a change of coordinates.) 
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Again with k = Q, the elliptic curve defined by y? = x? + 7x has the rational 
points (0,1,0) and (0,0, 1), and it can be shown with somewhat more advanced 
techniques that these are the only ones. By contrast, the elliptic curve defined 
by y* = x3 —2 has infinitely many rational points (which moreover form a cyclic 
group); this is all the more remarkable because only two of these points have integer 
coordinates (see §4.5). More broadly, the question whether a given elliptic curve 
over Q possesses finitely or infinitely many rational points illustrates a challenging 
and interesting research area in contemporary mathematics, likely to remain a topic 
of lively investigation for a long time to come. 


18.9 Let K(X1,...,Xn)/K be a purely transcendental extension and E an inter- 
mediate field thereof. Liiroth’s Theorem (see §5.9) says that in the case n = | the 
extension £'/K is purely transcendental as well. Is is natural to ask what can be said 
in the case n > 1, if one assumes additionally that TrDeg(£/K) =n, or equivalently 
that K(X,,..., Xn) /E is finite. 

For n = 2 we have a theorem of Castelnuovo: If K is algebraically closed and 
K(X, X2)/E is finite and separable, then E/K is also purely transcendental. (The 
separability assumption cannot be lifted.) 

The parallel statement for m = 3 is no longer valid (for all this see references 
in Hartshorne, Algebraic Geometry, GTM 52, Springer, New York, 1977). Also 
Ischebeck and others were able to show that even for n = 2 there are counterexamples 
when instead of assuming K algebraically closed one takes K = R as the ground 
field. Compare also the remarks at the end of Chapter 16. 


Chapter 19: Hilbert’s Nullstellensatz 


19.1 Let A be an affine K-algebra, that is, a K-algebra A = K[x1,..., Xn] generated 
by finitely many elements, and let C be an algebraically closed extension of K. 
Prove that there exists a homomorphism of K-algebras K|x1,...,Xn]|—> C. 

(Hint: Consider the kernel a of K[X,,..., Xn] —> K[x1,..., Xn]. Then the state- 
ment can be most easily deduced from Theorem 2; in fact it can be regarded as yet 
another version of the Hilbert Nullstellensatz.) 

Can one still demand in the case of an integral domain K[x,,..., Xn] that among 
specified elements /i,..., f- € K[x1,..., Xn], none should map to 0? (Hint: Con- 
sider K[x1,...,Xn,1/fi,...,1/fr]) 


19.2 Let C be a fixed algebraic closure of K. Prove that the maximal ideals of 
K[X1,..., Xn] are exactly the ideals of the form m(a) = {f € K[X1,..., Xn] | 
F(a) = 0} with a = (a1,...,@n) € C”. If K is algebraically closed, these ideals 
are more explicitly described in the form m(a) = (X; —@1, X2—-2,..., Xn —Qy). 


19.3 Prove that the nilradical JO of an affine K-algebra A = K[x1,X2,...,Xn] 
coincides with the Jacobson radical of A, which is defined as the intersection of all 
the maximal ideals of A. This is a weak form of the Hilbert Nullstellensatz. 

(Hint: Set A = K[X,,...,Xn]/a and take f € K[X),...,X,]. If f is not 
nilpotent, there exists (@1,...,@n) € Nc(a) such that f(a1,...,Qn) #0.) 
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19.4 Prove that if every prime ideal in a commutative ring R with unity is finitely 
generated, R is Noetherian. Hint: Let a be maximal among all nonfinitely generated 
ideals of R. For b € a, consider the ideal a: bR={x € R| xb ea}. 


19.5 Let A/R be an integral extension of integral domains, and let R be integrally 
closed (in its fraction field K). Also let 58 be a prime ideal of A and set p= BNR. 
Prove that every prime ideal q of R contained in p satisfies 


qAy ON R=q. 


Hint: Any B € qAsg has the form 6B = a/s witha €qA andseS =A. The 
minimal equation of a € qA over K then has the form 


a tan ++-+-+a)=0 with a; €q, 


by §16.5. Therefore, if one further assumes that B € R, the minimal equation of 
s =a/B over K is 
Gn-1 n-1 do 
gs" + ——s"" 4---+ ——=0. 
B pn 
But s is integral over R, so all the a,—-;/f' lie in R. If B were not in q, we would 
have s"—€qACpACR. 


19.6 Prove F6*, Krull’s Descent Lemma. Hint: It suffices to show that there is a 
prime ideal Q in As; such that QM R= q. Indeed, by §4.12, Q is of the form 
QAs, for Q a prime ideal of A satisfying Q C PB and QM A = Q; there follows 
g= ONR=QNR. Now, in order to show that such a Q exists, we can’t just use 
the going up theorem of §16.12, since As; is generally not integral over R. But we 
can use the result of §19.5, which suffices because of §16.6. 


19.7 Prove (using the Hilbert Basis Theorem) that the content of Hilbert’s Nullstel- 
lensatz (Theorem 1) can easily be quantitatively strengthened as follows: For every 
ideal a of K[X1,..., Xn] there is a natural number m such that f” € a whenever 
f € K[M,..., Xn] vanishes on the zero set of a. (Actually finding such an m for 
a specified a, described say by a set of generators, is a different matter.) 


19.8 Over an algebraically closed field, a system of homogeneous algebraic equa- 
tions in more unknowns than there are equations always has a nontrivial solution. 
Prove this using the road map below, following H.-J. Nastold: 
Let K be an algebraically closed field, and suppose /1,..., f; are polynomials 
in K[X,,..., Xn] such that N(fi,..., f-) = {0}. Then: 
(a) There exists g € N such that ¥7 € (fi,..., f-) for all i. 


(b) Take g as in (a). If the f; are homogeneous, there exists for each 7 certain 
homogeneous polynomials /;,...,4, in K[X1,..., Xn] such that 
Xfahfit---+hyf, and degh; <q. 


(c) Set A= K[M,..., Xn] and R= K[f{,..., f-]. If the ff are homogeneous, 
the ring extension A/R is finite. (Hint: By (b), any monomial of degree > nq 
can be generated by monomials of lower degree.) 
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(d) If the f; are homogeneous, then r > n. 


At the end of Chapter 27 we shall see what is needed in order to derive this 
theorem by means of a simple dimensionality argument. 


19.9 (a) Let a be any ideal of a commutative ring R with unity, and let p1,p2,..., py 
be prime ideals of R such that 


aC p; Up2 U---Up,. 


Prove that there exists i such that a C p;. 

Hint: Let r be minimal with the property that a is contained in a union as 
above. For 1 < j <r, there exists aj €aMp; such that a; ¢ p; for all i F /. 
Now, if r > 1, consider the element a,a2...d,—1 + a+. 


(b) Deduce from this that, if V,W,,..., W, are algebraic K-sets in C” with the 
W; irreducible and none of them contained in V, there is f in K[X1,..., Xn] 
such that f vanishes on V but not on any of the W;. 


19.10 Let V be any algebraic K-set of C”, where C is assumed algebraically closed. 
Prove that, for each 1 <i <n-+1, there exist polynomials /1,..., f; and algebraic 
K-sets Z,,...,Z; of C” such that 


(1) NAO: ANG) =VUZ;, with dim Z; <n—i. 


Hint: We can start by assuming that there is some nonzero /f; that vanishes on V. 
Then N( fi) = V U Z;, where Z, is the union of all the K-components of N(/1) 
that don’t already lie in V. If Z; 4 ©, there is by §19.9 some fz that vanishes on V 
but not on any K-component of Z;. Then N(f,) NN (f2) = VU (Z; AN(f2)), and 
by (30) we have dim(Z, AN(f2)) <dim Z; —1 < (n—1) —1 =n—2. Continuing 
in this way leads to the assertion. 

In the case i = n + 1, equation (1) amounts to Kronecker’s result given in F9. 


19.11 Round off §19.2 by proving that every maximal ideal a of K[X1,..., Xn] 
can be generated by n irreducible polynomials /f1,..., fn. (As remarked at the end 
of Chapter 19, it is even true that every reduced ideal of K[X1,..., Xn] can be 
generated by n polynomials, but the proof cannot be supplied so readily as for the 
special case of this exercise.) 

Hint: By §19.2, such an a is the vanishing ideal of a point a = (a@,...,@,) in 
C”. Now choose f/f, in K[X1,..., Xn—1][Xn] such that f,(a1,...,Q@n—1, Xn) is the 
minimal polynomial of a» over the field K[a1,...,@,-1]. Take f € a. Division 
with rest over K[X1,..., Xn—-1] yields f = qf, +1, with some polynomial r all 
of whose coefficients lie in the kernel a,—-; of K[X1,..., Xn—-1] > A[oi,...,Q@n—1]. 
By induction we can assume that a,—1 is generated by fi,..., fn-1. Then f lies 


in (fi1,---. fn—-1, In): 


19.12 Prove the following generalization of F4: Every reduced ideal a in a Noethe- 

rian ring is an intersection of finitely many prime ideals; if we insist that none of 

these ideals be contained in one another, they are uniquely determined up to order. 
Hint: Use the proof of F4 for guidance. Note that Voce = Jone. 


Index of Notation 


N = {1,2,3,...} set of natural numbers 

Z ring of integers 

Q field of rational numbers 

R field of real numbers 

C =R? complex plane, field of complex numbers 
|M | cardinality of a set M@ 

JZ (some) square root of z € C 

Vy (some) n-th root of y € K (in C) 

AM set of numbers constructible from M 
Li(M), Ci(M) 

K(A), K(a1,...Qn) field obtained by adjunction 
E:K=[E: K] degree of a field extension E/K 
Mipox (a) minimal polynomial of a over K 
a:K=K(a):K degree of a over K 

E/K field extension 

K{a] ring obtained by adjunction 

Q* field of all algebraic numbers (in C) 

L,L2 composite of fields 

K(X] polynomial ring on one variable Y over K 
ker g kernel of a ring homomorphism 

R/T quotient ring modulo an ideal J of R 
a=bmodI 

(a) = Ra principal ideal of R generated by ae R 


204 


284 Index of Notation 


R/a, K{x]/f 

Frac(R) fraction field of an integral domains R 

K(X) field of rational functions in one variable over K 
char(K) characteristic of a field K 

Fp =Z/pZ 

a|b a divides b (in a commutative ring R) 


R* group of units of a ring R with unity 


a=b a and b are associated (in a commutative ring with unity) 
gcd, lcm greatest common divisor, least common multiple 
Wr (a) exponent of the prime factor z in a 


Wz : K[X] — ZU {00} 

0: E\/K — E2/K homomorphism of field extensions 
KG group algebra of G over K 

@je, Ai tensor product of K-algebras A;, i € J 


A®xkB=A®QB tensor product of K-algebras A, B 
V@xrWw=VeEwW tensor product of K-modules V, W 

a1 @d2@---@an, @j ai, X@y tensors 

K(X] polynomial ring in the variables ¥ € X over K 
K[X1,..., Xn] polynomial ring in n variables over K 

la: K]y =[K(q@): K]s separable degree of a over K 
G(E,/K, E2/K) set of 0: £1 /K —> E2/K 

G(E/K) = G(E/K, E/K) 

[E : K]s separable degree of algebraic extension E/K 

Es separable closure of K in EF 

[E: K]j,=E: Es inseparable degree of E'/K 

KP subfield of p-th powers in a field K of characteristic p 
E® fixed field of the group G of automorphisms of a field E 
G(E/K) Galois group of a Galois extension E'/K 

D(f) discriminant of a normalized polynomial f 

Fg =Fpa finite field with g = p” elements 

<a> subgroup of G generated by a € G 

ord(a) order of group element a 


y(n) Euler totient function 
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(Z/n)* = (Z/nZ)* group of prime residue classes mod n 
W(K) group of roots of unity of K 

W,(K) group of n-th roots of unity of K 

bn primitive n-th root of unity 

F(X) n-th cyclotomic polynomial 

Og Frobenius automorphism (of an extension of finite fields) 
S(M) group of all bijection of a set M@ 

Sn symmetric group in n elements 

G/H set of left cosets oH, 0 €G 

G:H index of a subgroup H in G 

Gx stabilizer of x 

ZG center of group G 

ZG(t) centralizer of t in G 

H<G His a subgroup of G 

HAG His anormal subgroup of G 

H<«G A is a characteristic subgroup of G 

NGgH normalizer of H in G 

Sylp(G) set of Sylow p-groups of G 

Np number of Sylow p-groups of G 

i? SG Xo 

Fx k-th Fermat number 

d(p) square-free integer d # 1 such that /d € Q(é ») 
pra (DO, 

(5) Legendre symbol 


oF 


lim Gi projective limit of a projective system (G;)je7 
Trv/K(a) = Trace (ay/K) trace 

Nu/x (a) = det(ay/x) norm 

Puyjx(a;X) characteristic polynomial 
Tr4/K,NajxK,Pa/K ditto for K-algebras A 

Tre/K, NEsK trace and norm for field extensions E/K 
§(x) =x? —-—x 

G* = Hom (G,C%*) group of characters of G 
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G** = (G*)* 
K*" group of n-th powers in K* 

VA set of all n-th roots of elements of A 

Ap p-component of an abelian group A 

e(G) exponent of a group G 

<M> subgroup of G generated by M CG 

AT torsion group of an abelian group A 

S1,..+,5n elementary symmetric functions in X1,..., Xn 


p(a1,...,dqa) cycle of length d in Sy 


sgn signature map on S;, 
An alternating group in 1 elements 
A/R ring extension 


H(M) algebraic closure of K(M) in E 

Trgd(E/K) transcendence degree of extension E/K 
N(M) set of roots of MC K[M,..., Xn] in C” 

I(N) ideal of subset N CC” in K[M,... Xn] 

Ja radical of the ideal a of R 

V0 nilradical of R 

dim A Krull dimension of a commutative ring A 

dim V = dim K[V] dimension of the algebraic K-set V 
SR ring of fractions of R relative S 


Ry localization of R at the prime ideal p 

PGL(n, K) = GL(n, K) / K* projective linear group 

Z 

Zp 

L(n) Mobius function 

G’ commutator group of the group G 

Gg” n-th commutator group of G 

Dex (f1,.--+ Bn) discriminant of the basis 61,...,B, of E/K 
Ox ring of algebraic integers of a number field K 


Dx discriminant of an algebraic number field K 
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213 
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178, 244 
abelian see also under Galois group 


extension 151 
field extension 152 


group 149 
elementary 268 
finite 154, 160 
free 161 


number field 259 
adjunction 6 
of asquare root 7 
affine 
K-algebra 224 
algebraic set 217 
coordinate ring 224 
subgroup of S(Fp) 187 
variety 222 
Alfes, Rainer ii 
algebra over K 11 
algebraic 
K-set 217 
closure 
absolute 57 
inanextension 19 
extension 17 
geometry 217 
independence of field homomorphisms 
261 
number 15 
number field 259, 275 
number theory 113 
over K 15 


algebraically (in)dependent 209 
algebraically closed 56,234 
alternating group 182 
angle see trisection 
Artin, Emil (1898-1962) 
120, 147, 261, 268 
associated element 34 
automorphism 
group of a field extension 
inner 96 
axiom of choice 39 


55, 113, 118, 


65, 76, 246 


Bertrand, Joseph Louis Frangois (1822- 
1900) 271 

Bertrand’s Theorem and Postulate 271 

Bézout, Etienne (1730-1783) 235 

Bézout ring 235 

BI-Wissenschaftsverlag ii 

bilinear form 134 

bilinear map 150 

binomial equation 143 

Borevich, Zenon I. (1924-1995) 260 

Brandis, Albrecht 232 


Capelli, Alfredo (1855-1910) 267 
Cardano, Geronimo (1501-1576) 
Carlitz, Leonard (1907-1999) 260 
Castelnuovo, Guido (1865-1952) 280 
casus irreducibilis 186 

Cauchy, Augustin (1789-1857) 
Cayley, Arthur (1821-1895) 94 
center of a group 97 

central subring 191 

centralizer 97 


165, 185 


101, 154 
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chain (in ordered set) 64 
character 149 
characteristic of a field 30 
characteristic subgroup 186, 254 
characteristic-simple group 254 
Chebyshev, Pafnuti L. (1821-1894) 271 
Chinese Remainder Theorem 42, 88 
choice, axiom of 39 
circle see also quadrature 

n-section of 3,9,51, 104,258 

rational parametrization 279 
CM-field 275 
Coers,H. ii 
Cohen, I.S. 274 
commutative ring with unity 33, 133 
commutator subgroup 268 
compass see ruler and compass 
complex multiplication field 275 
p-component 155 
composite of fields 20 


conjugate 
group element 97 
over K 65 
subgroup 98 
constructibility see ruler and compass 


constructible number 2,5 
content of a polynomial 48 
coset 95 

crossed homomorphism 263 
cube see under doubling 


cubic equation 165,185 
cycle 179 
cyclic 


extension 144 
Galois group 81, 92, 139, 143 
permutation 179 
cyclotomic 
field 259 
polynomial 90,91, 250 
Daldrop, Hans _ ii 
Davenport, Harold (1907-1969) 260 
decomposition see factorization 
Dedekind, Richard (1831-1916) 
117, 190, 200, 242 
Dedekind reduction principle 190 
degree 
formula 12 
of afield extension 11 


i, 10,55, 


of anelement 17 
Delian problem 2, 27 
derivative 69 
descent lemma 228, 281 
determinant of group 262 
Deuring, Max (1907-1984) 123 
dimension 
ofaring 225 
of an algebraic set 224 
directly indecomposable modules 123 
directory of primes 39 
Dirichlet, Peter Gustav Lejeune (1805-1859) 
251 
discriminant 
of algebraic number field 275 
of basis 274 
of polynomial 
divisible, divisor 
double coset 98 
doubling of the cube 
Drinfeld, G.I. 203 
dual of group 149 


81, 165, 183, 272, 274 
33, see also factorization 


2,9,27 


Eisenstein, Gotthold (1823-1852) 50, 241 
Eisenstein criterion 50,53, 73,80, 189 
elementary 

abelian group 268 

symmetric functions 
elliptic curve 236,279 
Engesser, Hermann ii 
Epkenhans, Hans | ii 


174, 177, 195, 273 


Euclid (ca. 365-300 B.C.) 26, 36, 38, 236, 
252 
Euclidean 
algorithm 38,236 
domain 36,38, 238 
valuation 36,38 


Euler g-function see _totient 
Euler, Leonhard (1707-1783) 
108, 113 

Euler-Lagrange Theorem 95 
Euler’s criterion 108 
exact sequence 150 
exponent 

ofagroup 155 

of an abelian extension 152 

of an abelian group 149 
extension 

algebraic 17 


27, 105, 


field 6,15 

finite 17,193 

integral 193 

normal 59 

ring 191 

simple 21,31 

solvable by radicals 166 
transcendental 17 


factorization 37,39,58 
Faltings, Gerd (1954—) 279 
Fermat, Pierre de (1607-1665) 
238 
Fermat number 258 
Ferro, Scipione (1465-1526) 165 
field 
algebraically closed 56, 234 
complex multiplication 275 
cyclotomic 259 
extension see under extension 
finite 71,83, 86,92, 260 
fixed 75 
Hilbertian 190 
intermediate 19 
of n-th roots of unity 88 
of algebraic numbers 19 
of constructible numbers 5 


105, 236, 


of fractions see fraction field 

of rational functions 29,53, 173, 190 
perfect 73 

prime 31 


pythagorean 263 
quadratic 259 
quadratically closed 5 
splitting 58 
finite 
abelian group 154 
field 71, 83,86, 92, 260 
field extension 17 
ring extension 193 
fixed field 75 
formal fractions 235 
four-group 182 
fraction field 28,235 
fractions, ring of 235 
free abelian 161 
Frenicle de Bessy (1605-1675) 105 
Frobenius, Georg (1849-1917) 92, 200, 
255, 257, 259, 262 
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Frobenius automorphism 92 
Fundamental Homomorphism Theorem 23 
Fundamental Theorem 
of Algebra 27, 36, 189 
of Galois theory 79, 128 
on Symmetric Functions 177, 195,273 
Galois, Evariste (1811-1832) 
187, 249 
extension 76,246 
group 76,79, see also finite, cyclic, 
abelian 
abelian 151,259 
cyclic 81,92, 197 
finite 263 
modulo primes 197 
of certain polynomials 189 
solvable 173 
theory 55, 76,79, 115, 128 
inverse problem 178,202 
Gauss, Carl-Friedrich (1777-1855) 1, 22, 
27, 45, 49, 52, 89, 104, 113 
Gauss’s Lemma 49, 196 
Gauss’s Theorem 195 
Gaussian 
integers 238 
periods 257 
sums 110 
ged 34 
general polynomial 175 
generators 161 
Goethe, Johann Wolfgang von (1749-1832) 
i 
going up theorem 274 
greatest common divisor 34 


i, 76, 166, 


group see also abelian, finite, Galois 
action 93 
transitive 80 
algebra 60 
determinant 262 
theory 93 


Hartshorne, Robin 217,280 

heptagon, regular 51,52 

heptakaidecagon 52,258 

Hermite, Charles (1822-1901) 52, 207 

Hilbert, David (1862-1943) 140, 190, 263 
Basis Theorem 221, 280,281 
irreducibility theorem 190,201 
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Nullstellensatz 219,280, 281 

Theorem Ninety 140, 263 
Hilbertian field 190 
homogeneous polynomial 242 
homomorphism 

crossed 263 

of K-algebras 55 

of field extensions 55 

theorem 23 
Huppert, Bertram (1927—) 257 
hyperelliptic curve 279 
hypersurface 229 


ideal 22 

maximal 41 

ofasetinC” 217 

prime 41 

principal 34 

product of-s 42,43 

reduced 219 

sum of-s 43 
index of asubgroup 95 
infinite Galois extensions 128 
inner automorphism 96 
inseparable 66 

degree 73,246 

purely 71 
integer, Gaussian 238 
integral 

basis 275 

closure 194 

domain 16,41 

over R 191 

ring extension 193 
integrality equation 191 
integrally closed 194, 195 
intermediate field 19 
inverse Galois theory 178, 202 
inversion formula 264 
irreducibility criterion 241 
irreducible 37 

K-component 223 

algebraic set 223 

polynomial 26, 48 

radicals 171 
Ischebeck, Friedrich G. 280 


Jacobi, Carl Gustav Jacob (1804-1851) 
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Jacobi symbol 112 
Jensen, Christian U. 202 


K-algebra 242 

K-conjugate 65 

K-homomorphism 55 

Klein, Felix (1849-1925) 182 

Kronecker, Leopold (1823-1891) 27,58, 
113, 188, 230, 234, 259, 275, 282 

Krull, Wolfgang (1899-1971) 123, 128, 
225, 228, 274, 281 

Krull topology 128 

Krull’s descent lemma 228, 281 

Kummer, Eduard (1810-1893) 152,276 

Kummer extension 152 

Kummer theory 153, 265, 268 

Kummer’s Lemma 276 


Lagrange resolvent 145 
Lagrange, Joseph Louis (1736-1813) 27, 
95, 145 
Lang, Serge (1927—) 190 
Icm, least common multiple 34 
Ledet, Arne 202 
Legendre, Adrien Marie (1752-1833) 107, 
113 
Legendre symbol 107, 149 
Leibniz, Gottfried Wilhelm (1646-1716) 
259, 262 
length 
of cycle 179 
of orbit 94 
Levy, Silvio ii 
Lindemann, Ferdinand (1852-1939) 15, 
52, 207 
linear algebra i, 11, 133, 242 
linear independence of field homomorphisms 
117 
local ring 237,274 
localization 235,237 
at prime ideal 237 
Lorenz, Falko 113, 259, 263 
Liiroth, Jakob (1844-1910) 240, 247, 280 
Liitkebohmert, Werner ii 


Malle, Gunther 202 

Matsumura, Hideyuki (1930-—) 225 
Matzat, B. Heinrich (1945—) 202 
maximal element 63 


maximal ideal 41,241 
Mersenne, Marin (1588-1648) 105,258 
metacyclic see solvable 


Miller, George Abraham (1863-1951) 255 


minimal polynomial 17,31 

Mobius, August Ferdinand (1790-1868) 
264 

MGbius function, inversion formula 264 

module over a ring (or field) 62, 192 

monoid algebra 60 


multiplicative 
function 237,264 
group 24 
of finite field 83 
map 62, 152 


subset 235,237, 242, 272 


Nastold, Hans-Joachim (1929-2004) _ ii, 
281 
Neukirch, Jiirgen (1937-1997) 113 
Newton, Sir Isaac (1643-1727) 271 
Newton’s formulas 271 
nilpotent group 101,256 
nilradical 218, 237 
Noether, Emmy (1882-1935) 201,263 
isomorphism theorem for groups 170 
Noether normalization theorem 214 
Noetherian ring 221 
norm 133 
normal 

basis 120, 122, 262 

closure 60,76 

field extension 59,76 

series 268 

subgroup 24,77, 253-254 
normalized polynomial 16 
normalizer 98 
Nullstellensatz 219,281 
number theory 113,264 


Opolka, Hans 105 
orbit 94 
formula 96 
order of a group element 85 
ordered set 63 


p-group 97, 154 
p-subgroup 99 
pairing 150 
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parity see signature 
pentagon, regular 1,9 
perfect field 73 
permutation 75, 178-182 

cycle decomposition 179 
mz, transcendence of 15,52, 203 
gr-adic valuation 40,47 
polygon, regular 1,3,9,51, 104, 258 
polynomial algebra 210 
polynomial ring 63 
prime 

element 37 

factorization 39,45 

field 31 

ideal 41 

numbers 39 

polynomial 26 

residue classes 88, 250 
primitive 

element 21 

theorem 78, 244 

polynomial 48 

root 86 

root of unity 88 
principal ideal 25, 34,41 
principal series 268 
product of ideals 42, 43 
product tensor 61 
projective linear group 247 
projective system, projective limit 126 
pure equation 143 
purely inseparable 71,246 
purely transcendental 213 
pythagorean field 263 
pythagorean triple 279 


quadratic 

equation 165 

field 259 

reciprocity law 107,110, 111, 258 
quadratically closed field 5 
quadrature of the circle 3,9, 15 
quotient 28 

homomorphism 23 

ring 23 


radical see also solvable by —-s 
extension 166 
of anideal 218 
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rational functions 29,53, 173, 190 
rational parametrization of circle 279 
reciprocity law 107, 111, 264 
reduced ideal 219 
relatively prime 42, 43 
Remak, Robert (1888-1942) 123 
residue-class ring 23 
resolvent 145 
ring 
extension 191 
of formal fractions 235 
polynomial 63 
simple 41 
n-throots 143 
root of unity 88 
Ruffini, Paolo (1765-1822) 178 
ruler and compass 1, 8, 51, 103, 258 


Sch6nemann, Theodor 241 

Scharlau, Winfried (1940—) 105 

Schmidt, Otto Juljewitsch (1891-1956) 
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Scholz, Arnold (1904-1942) 178 

Schopohl, Ralf ii 

Schreier, Otto (1901-1929) 147, 268 

Schulze-Relau, Hubert — ii 

Schur, Issai (1875-1941) 250 

Seidenberg, Abraham (1916-1988) 274 


separable 
closure 71 
degree 66 


element 66 
polynomial 69, 143 
17-gon 52,258 
Shafarevich, Igor R. (1923—) 178, 260 
signature map 181, 269 
simple 
field extension 21,31,78 
group 182,254 
ring 41 
size of orbit 94 
solvable 
by irreducible radicals 171 
by radicals 1, 166 
equation of prime degree 187,271 
group 167, 173, 268,272 
Speiser, Andreas (1885-1970) 263 
Spektrum Akademischer Verlag i 
Spencer, Mark ii 


splitting field 58 

Springer-Verlag ii 

square-free integer 50 

stabilizer 96 

Steinitz, Ernst (1871-1928) 57 

Stevin, Simon (1548-1620) 26 

Storch, Uwe 230 

structure theorem for finite abelian groups 
160 

substitution homomorphism 21, 63 

successive see under adjunction 

sum of ideals 43 

Swan, RichardG. 201 

Sylow, Ludvig (1832-1918) 99, 101 

Sylow group (or p-subgroup) 99, 255 

Sylow’s Theorems 100 

symmetric functions 174, 177, 195, 273 


Tartaglia, Nicolo (1500-1557) 165 

tensor product 242 
of K—algebras 61 

Theorem = see also fundamental 
90 of Hilbert 140, 263 
Artin-Schreier 147 
Bertrand 271 
Capelli 267 
Cauchy 101, 154 
Cayley 94 
Chinese Remainder 42 
Dirichlet 251 
Euler-Lagrange 95 
Hermite—Lindemann 207 
Hilbert’s basis 221, 280, 281 
Hilbert’s irreducibility 190,201 
Kronecker 27 
Kronecker—Weber 259 
Liiroth 240, 247, 280 
Lindemann 15 
Noether normalization 214 
Nullstellensatz 219, 280, 281 
on normal bases 120, 122, 262 
primitive element 78, 244 
structure of finite abelian groups 160 
Sylow 100 
Translation 115 
Wilson 259 

topological group 127 

torsion group 154 

torsionfree group 162 


totally real 276 
totient function 88, 264 
trace 133 
transcendence 

basis 211 

degree 213,231 

of z 15,52, 203 
transcendental 

extension 17 

number 15 

over K 15 
transitive group action 80, 94 
transitivity 

of algebraicness 19 

of separability 69 
translation 94 
Translation Theorem 115 
transposition 181, 189 
trisection of the angle 1, 2,9, 52 
Tschebyschev see Chebyshev 


UFD 37,45, 195 
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universal property 21 


valuation 36, 38,40, 47 
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Weber, Heinrich (1842-1913) 259 

Weierstrass, Karl (1815-1897) 206 

Wielandt, Helmut (1910-2001) 255 

Wikipedia 64 

Willhoft, Oda ii 

Wilson, John (1741-1793) 259 

Witt, Ernst (1911-1991) 149, 268 


Yui, Noriko 202 
Zimmermann, Nadja 264 


Zorn, Max August (1906-1993) 64 
Zorn’s Lemma _ 63 


